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FIELD OF THE INVENTION 

The invention relates to nucleic acid and amino acid sequences involved in 
polyhydroxyalkanoate biosynthesis, and more specifically, to polyhydroxyalkanoate 
biosynthesis sequences isolated from Bacillus megaterium. In particular, nucleic acid 
sequences phaP, phaQ, phaR, phaB, phaC, and their encoded amino acid sequences are 
disclosed. 

BACKGROUND OF THE INVENTION 

This patent application is related to U.S. Provisional Application Serial Number 
60/115,092, filed on January 7, 1999. The government may own partial rights to the 
present invention pursuant to grant number MCB 9604450 from the National Science 
Foundation. 

Polyhydroxyalkanoic acids (PHA) are a class of aliphatic polyesters that 

accumulate in inclusion-bodies in many bacteria and archaea (2, 41). Their physiological 

role in the cell is that of carbon and energy reserves, and as a sink for reducing power. 

The most studied PHA have repeating subunits of: -[0-CH(R)(CH 2 ) x CO]-, where the 

most common form is polyhydroxybutyrate (PHB), with R = CH 3 and x = 1 (45). The 

PHA biosynthetic pathway has been determined for Alcaligenes eutrophus (17, 18, 44). 

In this organism two molecules of acetyl-Coenzyme A (CoA) are condensed by P- 

ketothiolase (PhaA), followed by a stereo-specific reduction catalyzed by an NADPH 

dependent acetoacetyl-CoA reductase (PhaB) to produce the monomer D-(-)-P- 

hydroxybutyryl-CoA, which is polymerized by PHA synthase (PhaC). These 3 pha genes 

are coded on the phaCAB operon, which is speculated to be constitutively expressed, but 

PHA is not constitutively synthesized. Alternative pathways for synthesis of the 

monomer in other organisms have been suggested, most notably in the Pseudomonas 

species where the side chain, R, is longer than CH 3 and its composition is influenced by 

carbon substrates in the growth medium (7, 45). In addition to A. eutrophus, phaC has 

been cloned from more than twenty different bacteria (26, 43). Other genes associated 
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with PHA synthesis, phaA, phaB, phaZ (PHA depolymerase) and genes for inclusion- 
body associated proteins and other low molecular weight proteins of unknown function, 
have also been cloned from some of these bacteria, in many cases by virtue of the fact 
that they are clustered with phaC. 

PHA inclusion-bodies are 0.2 to 0.5 |um in diameter, but their structural details are 
largely unknown. They were described originally for some species of Bacillus (6, 8, 15, 
30, 47) and later for many more bacteria including Pseudomonas, Alcaligenes and 
Rhodococcus (5, 1 1, 12, 25, 42). Those from Bacillus megaterium were shown to contain 
97.7% PHA, 1.87% protein and 0.46% lipid with protein and lipid forming an outer layer 
(15). More recent reports show the presence of a 14 kDa protein (GA14) on PHA 
inclusion-bodies of R. ruber (36, 37), and a 24 kDa protein (GA24) with similarities to 
GA14 on the inclusion-bodies of A. eutrophus (48). These proteins are not essential for 
PHA accumulation but have been shown to influence the size of PHA inclusion-bodies 
and the rate of PHA accumulation (37, 48). GA14 and GA24 have been named "phasins" 
due to some similarities with oleosins, which are proteins on the surface of oil bodies in 
plant seeds (21). Granule associated proteins are wide-spread in PHA accumulating 
bacteria (49). 

The pattern of PHA inclusion-body growth and proliferation throughout the 
growth cycle of Bacillus megaterium has been described (32). 

There exists a need for additional nucleic acid and amino acid sequences useful 
for the production of polymers in biological systems. 



This invention is the result of a study of PHA inclusion-body associated proteins 
from Bacillus megaterium and the cloning and analysis of their coding region. The 
transcription starts were identified, the functional expression of several of the sequences 
was confirmed in Escherichia coli and in PHA negative mutants of Bacillus megaterium 



SUMMARY OF THE INVENTION 
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and Pseudomonas putida, and PhaP and PhaC were localized to PHA inclusion-bodies 
throughout growth. 

A nucleic acid fragment encoding proteins involved in polyhydroxyalkanoate 
biosynthesis was isolated from Bacillus megaterium. Nine nucleic acid sequences and 
their encoded amino acid sequences are disclosed. Sequences encoding PhaB and PhaC 
display not insignificant percent identity and similarity to known acetoacetyl-CoA 
reductase and polyhydroxyalkanoate synthase proteins, while sequences encoding PhaP, 
PhaQ, and PhaR do not display significant similarity to known sequences. YkoY is 
similar to known toxic anion resistance proteins; YkoZ is similar to known RNA 
polymerase sigma factors; YkrM is similar to known Na + -transporting ATP synthase 
proteins; and SspD matches the known B. megaterium spore specific DNA binding 
protein. 

While several PHA related sequences were expressed in two organisms, it is 
envisioned that the sequences may be expressed in a wide array of organisms, and that the 
nucleic acid sequences themselves may be modified to change the sequence and 
properties of the encoded proteins. 

DESCRIPTION OF THE FIGURES 

The following figures form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

Figure 1. PHA inclusion-body associated proteins. SDS-polyacrylamide gel 
electrophoresis of proteins released from purified PHA inclusion-bodies. Lane 1, 
molecular weight markers in kDa, 14, 18, 29, 43, 68 and 97. Lane 2, proteins from 
inclusion-bodies of cells harvested at late exponential growth phase. Lane 3, same as 
lane 2 except this part of the gel was stained following 45 minutes transfer of proteins 
(seen in lane 2) to PVDF membrane. The bands were visualized by staining with 
Coomassie Blue. 
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Figure 2 (A): The pha sequence cluster and flanking sequences. Map of cloned 
fragment in pGMlO carrying the pha genes (stripped arrows), intergenic regions (igrs) 
and flanking genes (thick black arrows) from Bacillus megaterium. The thin arrows 
indicate the locations and directions of transcripts; P, indicates promoter positions. 
pGMl, pGM6, pGM9 and pGM7 indicate the cloned DNA fragments in these plasmids 
(Table 1). Probes used to identify and clone the pha cluster are indicated by thick short 
lines under pGMl; n2 and n5 are degenerate probes; bmp and bmc are homologous 
probes to the ends of the pGMl fragment. Ruler of sequence in base pairs is for Bacillus 
megaterium and B. subtilis. Map of yko, sspD and ykr region in the B. subtilis genome; 
genes with homology to those of Bacillus megaterium in this region are indicted by thick 
black arrows; non-homologous genes are indicated by thick gray arrows. Gene 
annotations are horizontal over each gene symbol. Relevant restriction enzyme sites are 
vertical. 

Figure 2 (B): Putative promoter regions for phaRBC, -Q, -P and sspD. Curved 
arrows indicate transcription start (+1), -10 and -35 nucleotides. The closest resemblance 
to known -10 and -35 promoter sequences are in lower case letters below putative pha 
promoter sequences. Immediately downstream from the PhaP stop codon, the previously 
described (9) sspD putative promoter is boxed, and putative hairpin structure is 
underlined. 

Figure 2 (C): Mapping of the 5* ends of the phaRBC, -Q and -P transcripts (see 
Example 1 1). Lanes G, A, T and C show the dideoxy sequencing ladders obtained with 
the same primers used in primer extension analysis; nucleotide sequences are 
complementary to the transcripts. Lane P is the primer extension product. Lane M is a 
DNA molecular size marker measured in nucleotides. The primer extension product is 
indicated by an arrowhead and the 5' end of the transcript within the sequence is indicated 
by a star. Only regions of the gel containing extension product bands are shown. 

Figure 3: Pairwise alignment of PhaC from Bacillus megaterium (this study) and 
P. oleovorans (SWISS-PROT accession no. P26494); amino acid identities are shown in 
black. The Clustal method with PAM250 residue weight table was used. 
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Figure 4. pha::gfp fusion plasmids and precursors. Only relevant restriction sites 
are shown. Annotations are as Figure 2. In all fusions the c-terminus excluding the stop 
codon, of either phaC or phaP, is fused to the gfp gene by the pGFPuv polylinker. For 
more details, see Table 1 . 

Figure 5 (A): Time-course analysis of Bacillus megaterium (pGM16.2) by phase 
contrast, green fluorescence, light image, and PHA fluorescence. Time (hours) are hours 
post-inoculation as indicated. 

Figure 5 (B): Growth curve for Figure 5 (A); arrowheads indicate a decrease in 
PhaP::GFP fluorescence. 

Figure 5 (C): Bacillus megaterium (pGM16.2) sampled at 2 days post-inoculation. 
Top image is phase contrast, bottom image is GFP fluorescence. 

Figure 5 (D): Bacillus megaterium (pGM13) sampled at 2 days post-inoculation, 
left -whole cells, right -lysed cells. Top image is phase contrast, bottom image is GFP 
fluorescence. 

Figure 5 (E): Bacillus megaterium (pGM13C) sampled at 9 hours post- 
inoculation. Top image is phase contrast, bottom image is GFP fluorescence. 

Figure 5 (F): Bacillus megaterium (pHPS9) showed no fluorescence at any time 
point. Top image is phase contrast, bottom image is GFP fluorescence. 

Figure 6: Hydrophilicity plot of PhaP protein. 

Figure 7: Hydrophilicity plot of PhaQ protein. 

Figure 8: Hydrophilicity plot of PhaR protein. 

Figure 9: Pairwise alignment of PhaC from Bacillus megaterium (this study) and 
T. violacea (SWISS-PROT accession no. P45366); amino acid identities are indicated by 
a star (*), and amino acid similarities are indicated by a period (.) below the sequences. 
The ClustalW method with PAM350 residue weight table was used. 

Figure 10: Proposed biosynthetic pathway for the preparation of C8 copolymers. 
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DESCRIPTION OF THE SEQUENCE LISTINGS 



The following sequence listings form part of the present specification and are 
included to further demonstrate certain aspects of the present invention. The invention 
may be better understood by reference to one or more of these sequences in combination 
with the detailed description of specific embodiments presented herein. 



SEQ ID NO 


Description 


1 


Bacillus megaterium 7,916 bp fragment 


2 


phaP nucleic acid sequence, 2566-3075 reverse complement 


3 


PhaP amino acid sequence, 170 amino acids 


4 


phaQ nucleic acid sequence, 3247-3684 reverse complement 


5 


PhaQ amino acid sequence, 146 amino acids 


6 


phaR nucleic acid sequence, 4170-4673 


7 


PhaR amino acid sequence, 168 amino acids 


8 


phaB nucleic acid sequence, 4758-5498 


9 


PhaB amino acid sequence, 247 amino acids 


10 


phaC nucleic acid sequence, 5578-6663 


11 


PhaC amino acid sequence, 362 amino acids 


12 


oligonucleotide probe n2, 39 bases 


13 


oligonucleotide probe n5, 30 bases 


14 


oligonucleotide probe bmp, 19 bases 


15 


oligonucleotide probe bmc, 22 bases 


16 


oligonucleotide primer for phaP transcription start, 20 bases 


17 


oligonucleotide primer for phaQ transcription start, 19 bases 


18 


oligonucleotide primer for phaRBC transcription start, 19 bases 


19 


N-terminal amino acid sequence of 14 kDa protein 


20 


N-terminal amino acid sequence of 20 kDa protein 


21 


N-terminal amino acid sequence of 41 kDa protein 


22 


ykoY nucleic acid sequence, 277-1089 


23 


YkoY amino acid sequence, 271 amino acids 


24 


ykoZ nucleic acid sequence, 1460-2167 


25 


YkoZ amino acid sequence, 236 amino acids 


26 


ykrM nucleic acid sequence, 6959-7916 (partial) 


27 


YkrM amino acid sequence, 319 amino acids (partial) 


28 


sspD nucleic acid sequence, 2419-2225 reverse complement 


29 


SspD amino acid sequence, 65 amino acids 
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DEFINITIONS 



The following definitions are provided in order to aid those skilled in the art in 
understanding the detailed description of the present invention. 

"C-terminal region" refers to the region of a peptide, polypeptide, or protein chain 
from the middle thereof to the end that carries the amino acid having a free a carboxyl 
group (the C-terminus). 

"CoA" refers to coenzyme A. 

The phrases "coding sequence", "open reading frame", and "structural sequence" 
refer to the region of continuous sequential nucleic acid triplets encoding a protein, 
polypeptide, or peptide sequence. 

The term "encoding DNA" or "encoding nucleic acid" refers to chromosomal 
nucleic acid, plasmid nucleic acid, cDNA, or synthetic nucleic acid which codes on 
expression for any of the proteins or fusion proteins discussed herein. 

The term "genome" as it applies to bacteria encompasses both the chromosome 
and plasmids within a bacterial host cell. Encoding nucleic acids of the present invention 
introduced into bacterial host cells can therefore be either chromosomally-integrated or 
plasmid-localized. The term "genome" as it applies to plant cells encompasses not only 
chromosomal DNA found within the nucleus, but organelle DNA found within 
subcellular components of the cell. Nucleic acids of the present invention introduced into 
plant cells can therefore be either chromosomally-integrated or organelle-localized. 

"Identity" refers to the degree of similarity between two nucleic acid or protein 
sequences. An alignment of the two sequences is performed by a suitable computer 
program. A widely used and accepted computer program for performing sequence 
alignments is CLUSTALW vl.6 (Thompson, et al. Nucl Acids Res., 22: 4673-4680, 
1994). The number of matching bases or amino acids is divided by the total number of 
bases or amino acids, and multiplied by 100 to obtain a percent identity. For example, if 
two 580 base pair sequences had 145 matched bases, they would be 25 percent identical. 
If the two compared sequences are of different lengths, the number of matches is divided 
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by the shorter of the two lengths. For example, if there were 100 matched amino acids 
between 200 and a 400 amino acid proteins, they are 50 percent identical with respect to 
the shorter sequence. If the shorter sequence is less than 150 bases or 50 amino acids in 
length, the number of matches are divided by 150 (for nucleic acids) or 50 (for proteins); 
and multiplied by 100 to obtain a percent identity. 

The terms "microbe" or "microorganism" refer to algae, bacteria, fungi, and 
protozoa. 

"N-terminal region" refers to the region of a peptide, polypeptide, or protein chain 
from the amino acid having a free amino group to the middle of the chain. 

"Nucleic acid" refers to ribonucleic acid (RNA) and deoxyribonucleic acid 

(DNA). 

A "nucleic acid segment" is a nucleic acid molecule that has been isolated free of 
total genomic DNA of a particular species, or that has been synthesized. Included with 
the term "nucleic acid segment" are DNA segments, recombinant vectors, plasmids, 
cosmids, phagemids, phage, viruses, etcetera. 

"Overexpression" refers to the expression of a polypeptide or protein encoded by 
a DNA introduced into a host cell, wherein said polypeptide or protein is either not 
normally present in the host cell, or wherein said polypeptide or protein is present in said 
host cell at a higher level than that normally expressed from the endogenous gene 
encoding said polypeptide or protein. 

The term "plastid" refers to the class of plant cell organelles that includes 
amyloplasts, chloroplasts, chromoplasts, elaioplasts, eoplasts, etioplasts, leucoplasts, and 
proplastids. These organelles are self-replicating, and contain what is commonly referred 
to as the "chloroplast genome," a circular DNA molecule that ranges in size from about 
120 to about 217 kb, depending upon the plant species, and which usually contains an 
inverted repeat region (Fosket, Plant growth and Development, Academic Press, Inc., San 
Diego, CA, p. 132, 1994). 
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"Polyadenylation signal" or "poly A signal" refers to a nucleic acid sequence 
located 3' to a coding region that directs the addition of adenylate nucleotides to the 3' 
end of the mRNA transcribed from the coding region. 

The term "polyhydroxyalkanoate (or PHA) synthase" refers to enzymes that 
convert hydroxyacyl-CoAs to polyhydroxyalkanoates and free CoA. 

The term "promoter" or "promoter region" refers to a nucleic acid sequence, 
usually found upstream (5') to a coding sequence, that controls expression of the coding 
sequence by controlling production of messenger RNA (mRNA) by providing the 
recognition site for RNA polymerase and/or other factors necessary for start of 
transcription at the correct site. As contemplated herein, a promoter or promoter region 
includes variations of promoters derived by means of ligation to various regulatory 
sequences, random or controlled mutagenesis, and addition or duplication of enhancer 
sequences. The promoter region disclosed herein, and biologically functional equivalents 
thereof, are responsible for driving the transcription of coding sequences under their 
control when introduced into a host as part of a suitable recombinant vector, as 
demonstrated by its ability to produce mRNA. 

"Regeneration" refers to the process of growing a plant from a plant cell (e.g., 
plant protoplast or explant). 

"Transformation" refers to a process of introducing an exogenous nucleic acid 
sequence (e.g., a vector, recombinant nucleic acid molecule) into a cell or protoplast in 
which that exogenous nucleic acid is incorporated into a chromosome or is capable of 
autonomous replication. 

A "transformed cell" is a cell whose nucleic acid has been altered by the 
introduction of an exogenous nucleic acid molecule into that cell. 

A "transformed plant" or "transgenic plant" is a plant whose nucleic acid has been 
altered by the introduction of an exogenous nucleic acid molecule into that plant, or by 
the introduction of an exogenous nucleic acid molecule into a plant cell from which the 
plant was regenerated or derived. 
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DETAILED DESCRIPTION OF THE INVENTION 

This invention was developed in the pursuit of proteins which are associated with 
polyhydroxyalkanoate inclusion bodies, and in the pursuit of novel nucleic acid and 
amino acid sequences from the bacteria Bacillus megaterium. A 7,916 base pair nucleic 
acid fragment was isolated and sequenced (SEQ ID NO:l). This fragment was found to 
contain nine open reading frames, five of which encode proteins suspected of being 
involved in polyhydroxyalkanoate biosynthesis. 

Genomic fragment 

An embodiment of the invention is a nucleic acid segment at least about 80% 
identical to SEQ ID NO:l. More preferably, the nucleic acid segment is at least about 
82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO:l. Alternatively, the nucleic acid segment may be a nucleic acid segment 
that hybridizes under stringent conditions to SEQ ID NO:l, or to the complement thereof. 
The nucleic acid segment may be obtained from a natural source, may be mutagenized, 
may be genetically engineered by mutagenesis or other methods, or may be synthetic. 

The invention is further directed to nucleic acid segments, proteins, recombinant 
vectors, recombinant host cells, genetically transformed plant cells, genetically 
transformed plants, methods of preparing host cells, methods of preparing plants, fusion 
proteins, and nucleic acid segments encoding fusion proteins. 

phaP and PhaP 

A nucleic acid segment may comprise a nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein, wherein the nucleic acid 
sequence is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:2; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO:2 or the complement thereof; a nucleic acid sequence encoding 
a protein at least about 80% identical to SEQ ID NO:3; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO: 3 
as an antigen, the antibody being immunoreactive with SEQ ID NO:3. More preferably, 
the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
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98%, 99%, 99.5%, or 100% identical to SEQ ID NO:2. The nucleic acid segment may be 
obtained from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:3. 

An isolated polyhydroxyalkanoate inclusion body associated protein may 
comprise an amino acid sequence selected from the group consisting of: an amino acid 
sequence at least about 80% identical to SEQ ID NO:3; and an amino acid sequence that 
is immunoreactive with an antibody prepared using SEQ ID NO:3 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:3. The protein is preferably at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
toSEQIDNO:3 

A recombinant vector may comprise in the 5' to 3' direction: a) a promoter that 
directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein; b) a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein; wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80%) identical to SEQ ID NO:2; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:2 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:3; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 3 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:3; and c) a 3' transcription terminator. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:2. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:3. The promoter may generally be any 
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promoter, and more preferably is a tissue selective or tissue specific promoter. The 
promoter may be constitutive or inducible. The promoter may be a viral promoter. The 
promoter may be a CMV35S, enhanced CMV35S, an FMV35S, a Lesquerella 
hydroxylase, or a 7S conglycinin promoter. 

A recombinant host cell may comprise a nucleic acid segment encoding a 
polyhydroxyalkanoate inclusion body associated protein, wherein the nucleic acid 
segment is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:2; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO:2 or the complement thereof; a nucleic acid sequence encoding 
a protein at least about 80% identical to SEQ ID NO:3; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO:3 
as an antigen, the antibody being immunoreactive with SEQ ID NO:3. More preferably, 
the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%, 99%, 99.5%, or 100% identical to SEQ ID NO:2. The nucleic acid segment may be 
obtained from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:3. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A genetically transformed plant cell may comprise in the 5' to 3' direction: a) a 
promoter that directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein; b) a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein; wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 



H: 365095(7TPJ01!.DOC) 



-13 of 112- 



Cannon Application 



acid sequence at least about 80% identical to SEQ ID NO:2; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:2 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:3; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO:3 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:3; c) a 3' transcription terminator; and d) a 3' polyadenylation signal 
sequence that directs the addition of polyadenylate nucleotides to the 3' end of RNA 
transcribed from the structural nucleic acid sequence. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:2. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:3. The plant may generally be any plant, and 
more preferably a monocot, dicot, or conifer. The plant is preferably a tobacco, wheat, 
potato, Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil seed rape, 
sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 

A method of preparing host cells useful to produce a polyhydroxyalkanoate 
inclusion body associated protein may comprise a) selecting a host cell; b) transforming 
the selected host cell with a recombinant vector having a structural nucleic acid sequence 
encoding a polyhydroxyalkanoate inclusion body associated protein, wherein the 
structural nucleic acid sequence is selected from the group consisting of: a nucleic acid 
sequence at least about 80% identical to SEQ ID NO:2; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:2 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:3; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 3 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:3; and c) obtaining transformed host cells. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 

-14 of 112- 

Cannon Application 

H: 365095(7TPJ01!.DOC) 



99.5%, or 100% identical to SEQ ID NO:2. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:3. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A method of preparing plants useful to produce a polyhydroxyalkanoate inclusion 
body associated protein may comprise a) selecting a host plant cell; b) transforming the 
selected host plant cell with a recombinant vector having a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein, wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:2; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:2 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:3; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 3 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:3; c) obtaining transformed host plant cells; and d) regenerating the 
transformed host plant cells. More preferably, the nucleic acid sequence is at least about 
82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO:2. The nucleic acid segment may be obtained from a natural source, may be 
mutagenized, may be genetically engineered by mutagenesis or other methods, or may be 
synthetic. The nucleic acid sequence preferably encodes a protein at least about 82%, 
84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO:3. The plant (and plant cell) may generally be any plant, and more preferably a 
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monocot, dicot, or conifer. The plant is preferably a tobacco, wheat, potato, Arabidopsis, 
and high oil seed plants such as corn, soybean, canola, oil seed rape, sugarbeet, 
sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 

The invention also relates to fusion proteins. A fusion protein may comprise a 
green fluorescent protein subunit; and a polyhydroxyalkanoate inclusion body associated 
protein subunit; wherein the polyhydroxyalkanoate inclusion body associated protein 
subunit comprises an amino acid sequence selected from the group consisting of: an 
amino acid sequence at least about 80% identical to SEQ ID NO:3; and an amino acid 
sequence that is immunoreactive with an antibody prepared using SEQ ID NO:3 as an 
antigen, the antibody being immunoreactive with SEQ ID NO:3. The 
polyhydroxyalkanoate inclusion body associated protein subunit is preferably at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
toSEQIDNO:3 

A nucleic acid segment encoding a fusion protein may comprise a nucleic acid 
sequence encoding a green fluorescent protein subunit; and a nucleic acid sequence 
encoding a polyhydroxyalkanoate inclusion body associated protein subunit; wherein the 
nucleic acid sequence encoding a polyhydroxyalkanoate inclusion body associated 
protein subunit is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:2; a nucleic acid sequence that hybridizes under 
stringent conditions to SEQ ID NO:2 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:3; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:3 as an antigen, the antibody being immunoreactive with SEQ ID NO:3. More 
preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 
94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:2. The nucleic acid 
sequence may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein subunit at least about 82%, 84%, 86%, 
88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ IDNO:3. 
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phaO and PhaO 

A nucleic acid segment may comprise a nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein, wherein the nucleic acid 
sequence is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:4; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO:4 or the complement thereof; a nucleic acid sequence encoding 
a protein at least about 80% identical to SEQ ID NO:5; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO: 5 
as an antigen, the antibody being immunoreactive with SEQ ID NO:5. More preferably, 
the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%, 99%, 99.5%, or 100% identical to SEQ ID NO:4. The nucleic acid segment may be 
obtained from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:5. 

An isolated polyhydroxyalkanoate inclusion body associated protein may 
comprise an amino acid sequence selected from the group consisting of: an amino acid 
sequence at least about 80% identical to SEQ ID NO:5; and an amino acid sequence that 
is immunoreactive with an antibody prepared using SEQ ID NO:5 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:5. The protein is preferably at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
to SEQ ID NO:5 

A recombinant vector may comprise in the 5' to 3' direction: a) a promoter that 
directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein; b) a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein; wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:4; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:4 or the complement thereof; a 
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nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:5; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 5 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:5; and c) a 3' transcription terminator. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:4. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%; 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:5. The promoter may generally be any 
promoter, and more preferably is a tissue selective or tissue specific promoter. The 
promoter may be constitutive or inducible. The promoter may be a viral promoter. The 
promoter may be a CMV35S, enhanced CMV35S, an FMV35S, a Lesquerella 
hydroxylase, or a 7S conglycinin promoter. 

A recombinant host cell may comprise a nucleic acid segment encoding a 
polyhydroxyalkanoate inclusion body associated protein, wherein the nucleic acid 
segment is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:4; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO:4 or the complement thereof; a nucleic acid sequence encoding 
a protein at least about 80% identical to SEQ ID NO:5; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO: 5 
as an antigen, the antibody being immunoreactive with SEQ ID NO: 5. More preferably, 
the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%, 99%, 99.5%, or 100% identical to SEQ ID NO:4. The nucleic acid segment may be 
obtained from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:5. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
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preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A genetically transformed plant cell may comprise in the 5' to 3' direction: a) a 
promoter that directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein; b) a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein; wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:4; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:4 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO: 5; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 5 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:5; c) a 3' transcription terminator; and d) a 3' polyadenylation signal 
sequence that directs the addition of poly adenylate nucleotides to the 3' end of RNA 
transcribed from the structural nucleic acid sequence. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:4. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:5. The plant may generally be any plant, and 
more preferably a monocot, dicot, or conifer. The plant is preferably a tobacco, wheat, 
potato, Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil seed rape, 
sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 

A method of preparing host cells useful to produce a polyhydroxyalkanoate 
inclusion body associated protein may comprise a) selecting a host cell; b) transforming 
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the selected host cell with a recombinant vector having a structural nucleic acid sequence 
encoding a polyhydroxyalkanoate inclusion body associated protein, wherein the 
structural nucleic acid sequence is selected from the group consisting of: a nucleic acid 
sequence at least about 80% identical to SEQ ID NO:4; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:4 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:5; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 5 as an antigen, the antibody being immunoreactive with 
SEQ ID NO: 5; and c) obtaining transformed host cells. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:4. The nucleic acid segment may be obtained 
from, a natural source^ may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:5. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A method of preparing plants useful to produce a polyhydroxyalkanoate inclusion 
body associated protein may comprise a) selecting a host plant cell; b) transforming the 
selected host plant cell with a recombinant vector having a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein, wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:4; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:4 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:5; 
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and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 5 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:5; c) obtaining transformed host plant cells; and d) regenerating the 
transformed host plant cells. More preferably, the nucleic acid sequence is at least about 
82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO:4. The nucleic acid segment may be obtained from a natural source, may be 
mutagenized, may be genetically engineered by mutagenesis or other methods, or may be 
synthetic. The nucleic acid sequence preferably encodes a protein at least about 82%, 
84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO:5. The plant (and plant cell) may generally be any plant, and more preferably a 
monocot, dicot, or conifer. The plant is preferably a tobacco, wheat, potato, Arabidopsis, 
and high oil seed plants such as corn, soybean, canola, oil seed rape, sugarbeet, 
sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 

The invention also relates to fusion proteins. A fusion protein may comprise a 
green fluorescent protein subunit; and a polyhydroxyalkanoate inclusion body associated 
protein subunit; wherein the polyhydroxyalkanoate inclusion body associated protein 
subunit comprises an amino acid sequence selected from the group consisting of: an 
amino acid sequence at least about 80% identical to SEQ ID NO:5; and an amino acid 
sequence that is immunoreactive with an antibody prepared using SEQ ID NO: 5 as an 
antigen, the antibody being immunoreactive with SEQ ID NO:5. The 
polyhydroxyalkanoate inclusion body associated protein subunit is preferably at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
to SEQ ID NO:5 

A nucleic acid segment encoding a fusion protein may comprise a nucleic acid 
sequence encoding a green fluorescent protein subunit; and a nucleic acid sequence 
encoding a polyhydroxyalkanoate inclusion body associated protein subunit; wherein the 
nucleic acid sequence encoding a polyhydroxyalkanoate inclusion body associated 
protein subunit is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:4; a nucleic acid sequence that hybridizes under 
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stringent conditions to SEQ ID NO:4 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:5; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO: 5 as an antigen, the antibody being immunoreactive with SEQ ID NO:5. More 
preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 
94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:4. The nucleic acid 
sequence may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein subunit at least about 82%), 84%, 86%, 
88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:5. 
vhaR and PhaR 

A nucleic acid segment may comprise a nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein, wherein the nucleic acid 
sequence is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:6; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 6 or the complement thereof; a nucleic acid sequence encoding 
a protein at least about 80% identical to SEQ ID NO:7; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO: 7 
as an antigen, the antibody being immunoreactive with SEQ ID NO:7. More preferably, 
the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%>, 99%o, 99.5%, or 100% identical to SEQ ID NO:6. The nucleic acid segment may be 
obtained from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:7. 

An isolated polyhydroxyalkanoate inclusion body associated protein may 
comprise an amino acid sequence selected from the group consisting of: an amino acid 
sequence at least about 80% identical to SEQ ID NO:7; and an amino acid sequence that 
is immunoreactive with an antibody prepared using SEQ ID NO: 7 as an antigen, the 
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antibody being immunoreactive with SEQ ID NO:7. The protein is preferably at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
to SEQ ID NO:7 

A recombinant vector may comprise in the 5' to 3' direction: a) a promoter that 
directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein; b) a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein; wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:6; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:6 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:7; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 7 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:7; and c) a 3' transcription terminator. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:6. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:7. The promoter may generally be any 
promoter, and more preferably is a tissue selective or tissue specific promoter. The 
promoter may be constitutive or inducible. The promoter may be a viral promoter. The 
promoter may be a CMV35S, enhanced CMV35S, an FMV35S, a Lesquerella 
hydroxylase, or a 7S conglycinin promoter. 

A recombinant host cell may comprise a nucleic acid segment encoding a 
polyhydroxyalkanoate inclusion body associated protein, wherein the nucleic acid 
segment is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:6; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 6 or the complement thereof; a nucleic acid sequence encoding 
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a protein at least about 80% identical to SEQ ID NO:7; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO: 7 
as an antigen, the antibody being immunoreactive with SEQ ID NO:7. More preferably, 
the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%, 99%, 99.5%, or 100% identical to SEQ ID NO:6. The nucleic acid segment may be 
obtained from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:7. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A genetically transformed plant cell may comprise in the 5' to 3' direction: a) a 
promoter that directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate inclusion body associated protein; b) a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein; wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:6; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO:6 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:7; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO:7 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:7; c) a 3' transcription terminator; and d) a 3' polyadenylation signal 
sequence that directs the addition of polyadenylate nucleotides to the 3' end of RNA 
transcribed from the structural nucleic acid sequence. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
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99.5%, or 100% identical to SEQ ID NO:6. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:7. The plant may generally be any plant, and 
more preferably a monocot, dicot, or conifer. The plant is preferably a tobacco, wheat, 
potato, Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil seed rape, 
sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 

A method of preparing host cells useful to produce a polyhydroxyalkanoate 
inclusion body associated protein may comprise a) selecting a host cell; b) transforming 
the selected host cell with a recombinant vector having a structural nucleic acid sequence 
encoding a polyhydroxyalkanoate inclusion body associated protein, wherein the 
structural nucleic acid sequence is selected from the group consisting of: a nucleic acid 
sequence at least about 80% identical to SEQ ID NO:6; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO: 6 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:7; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 7 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:7; and c) obtaining transformed host cells. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:6. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:7. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
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such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A method of preparing plants useful to produce a polyhydroxyalkanoate inclusion 
body associated protein may comprise a) selecting a host plant cell; b) transforming the 
selected host plant cell with a recombinant vector having a structural nucleic acid 
sequence encoding a polyhydroxyalkanoate inclusion body associated protein, wherein 
the structural nucleic acid sequence is selected from the group consisting of: a nucleic 
acid sequence at least about 80% identical to SEQ ID NO:6; a nucleic acid sequence that 
hybridizes under stringent conditions to SEQ ID NO: 6 or the complement thereof; a 
nucleic acid sequence encoding a protein at least about 80% identical to SEQ ID NO:7; 
and a nucleic acid sequence encoding a protein that is immunoreactive with an antibody 
prepared using SEQ ID NO: 7 as an antigen, the antibody being immunoreactive with 
SEQ ID NO:7; c) obtaining transformed host plant cells; and d) regenerating the 
transformed host plant cells. More preferably, the nucleic acid sequence is at least about 
82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO:6. The nucleic acid segment may be obtained from a natural source, may be 
mutagenized, may be genetically engineered by mutagenesis or other methods, or may be 
synthetic. The nucleic acid sequence preferably encodes a protein at least about 82%, 
84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO:7. The plant (and plant cell) may generally be any plant, and more preferably a 
monocot, dicot, or conifer. The plant is preferably a tobacco, wheat, potato, Arabidopsis, 
and high oil seed plants such as corn, soybean, canola, oil seed rape, sugarbeet, 
sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 

The invention also relates to fusion proteins. A fusion protein may comprise a 
green fluorescent protein subunit; and a polyhydroxyalkanoate inclusion body associated 
protein subunit; wherein the polyhydroxyalkanoate inclusion body associated protein 
subunit comprises an amino acid sequence selected from the group consisting of: an 
amino acid sequence at least about 80% identical to SEQ ID NO:7; and an amino acid 
sequence that is immunoreactive with an antibody prepared using SEQ ID NO: 7 as an 
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antigen, the antibody being immunoreactive with SEQ ID NO:7. The 
polyhydroxyalkanoate inclusion body associated protein subunit is preferably at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
to SEQ ID NO:7 

A nucleic acid segment encoding a fusion protein may comprise a nucleic acid 
sequence encoding a green fluorescent protein subunit; and a nucleic acid sequence 
encoding a polyhydroxyalkanoate inclusion body associated protein subunit; wherein the 
nucleic acid sequence encoding a polyhydroxyalkanoate inclusion body associated 
protein subunit is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:6; a nucleic acid sequence that hybridizes under 
stringent conditions to SEQ ID NO:6 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:7; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:7 as an antigen, the antibody being immunoreactive with SEQ ID NO:7. More 
preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 
94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:6. The nucleic acid 
sequence may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein subunit at least about 82%, 84%, 86%, 
88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:7. 

phaB and PhaB 

A nucleic acid segment may comprise a nucleic acid sequence encoding a 3-keto- 
acyl-CoA reductase protein, wherein the nucleic acid sequence is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO:8; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO:8 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO:9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9. More preferably, the nucleic acid 
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sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:8. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:9. 

An isolated 3-keto-acyl-CoA reductase protein may comprise an amino acid 
sequence selected from the group consisting of: an amino acid sequence at least about 
80% identical to SEQ ID NO:9; and an amino acid sequence that is immunoreactive with 
an antibody prepared using .SEQ ID NO:9 as an antigen, the antibody being 
immunoreactive with SEQ ID NO:9. The protein is preferably at least about 82%, 84%, 
86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:9 

A recombinant vector may comprise in the 5' to 3' direction: a) a promoter that 
directs transcription of a structural nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; b) a structural nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; wherein the structural nucleic acid sequence is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO:8; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 8 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO:9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9; and c) a 3' transcription terminator. 
More preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 
92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:8. The nucleic 
acid segment may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:9. The 
promoter may generally be any promoter, and more preferably is a tissue selective or 
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tissue specific promoter. The promoter may be constitutive or inducible. The promoter 
may be a viral promoter. The promoter may be a CMV35S, enhanced CMV35S, an 
FMV35S, a Lesquerella hydroxylase, or a 7S conglycinin promoter. 

A recombinant host cell may comprise a nucleic acid segment encoding a 3-keto- 
acyl-CoA reductase protein, wherein the nucleic acid segment is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO:8; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 8 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO: 9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:8. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:9. The host cell may generally be any host cell, 
and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A genetically transformed plant cell may comprise in the 5' to 3' direction: a) a 
promoter that directs transcription of a structural nucleic acid sequence encoding a 3- 
keto-acyl-CoA reductase protein; b) a structural nucleic acid sequence encoding a 3-keto- 
acyl-CoA reductase protein; wherein the structural nucleic acid sequence is selected from 
the group consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID 
NO: 8; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 8 
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or the complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO:9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9; c) a 3' transcription terminator; and 
d) a 3' polyadenylation signal sequence that directs the addition of polyadenylate 
nucleotides to the 3' end of RNA transcribed from the structural nucleic acid sequence. 
More preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 
92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:8. The nucleic 
acid segment may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:9. The plant 
may generally be any plant, and more preferably a monocot, dicot, or conifer. The plant 
is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants such as corn, 
soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, 
or alfalfa plant. 

A method of preparing host cells useful to produce a 3-keto-acyl-CoA reductase 
protein may comprise a) selecting a host cell; b) transforming the selected host cell with a 
recombinant vector having a structural nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein, wherein the structural nucleic acid sequence is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO: 8; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 8 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80%) 
identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO:9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9; and c) obtaining transformed host 
cells. More preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:8. The 
nucleic acid segment may be obtained from a natural source, may be mutagenized, may 
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be genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:9. The host 
cell may generally be any host cell, and preferably is a bacterial, fungal, mammalian, or 
plant cell. The bacterial cell is preferably an Escherichia coli, Bacillus, Pseudomonas, or 
Ralstonia eutropha cell. The fungal cell is preferably a Saccharomyces cerevisiae or 
Schizosaccharomyces pombe cell. The plant cell is preferably a tobacco, wheat, potato, 
Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil seed rape, 
sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa cell. 

A method of preparing plants useful to produce a 3-keto-acyl-CoA reductase 
protein may comprise a) selecting a host plant cell; b) transforming the selected host plant 
cell with a recombinant vector having a structural nucleic acid sequence encoding a 3- 
keto-acyl-CoA reductase protein, wherein the structural nucleic acid sequence is selected 
from the group consisting of: a nucleic acid sequence at least about 80% identical to SEQ 
ID NO: 8; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID 
NO: 8 or the complement thereof; a nucleic acid sequence encoding a protein at least 
about 80% identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that 
is immunoreactive with an antibody prepared using SEQ ID NO:9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9; c) obtaining transformed host plant 
cells; and d) regenerating the transformed host plant cells. More preferably, the nucleic 
acid sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:8. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:9. The plant (and plant cell) may generally be 
any plant, and more preferably a monocot, dicot, or conifer. The plant is preferably a 
tobacco, wheat, potato, Arabidopsis, and high oil seed plants such as corn, soybean, 
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canola, oil seed rape, sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa 
plant. 

The invention also relates to fusion proteins. A fusion protein may comprise a 
green fluorescent protein subunit; and a 3-keto-acyl-CoA reductase protein subunit; 
wherein the 3-keto-acyl-CoA reductase protein subunit comprises an amino acid 
sequence selected from the group consisting of: an amino acid sequence at least about 
80% identical to SEQ ID NO:9; and an amino acid sequence that is immunoreactive with 
an antibody prepared using SEQ ID NO:9 as an antigen, the antibody being 
immunoreactive with SEQ ID NO:9. The 3-keto-acyl-CoA reductase protein subunit is 
preferably at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, 
or 1 00% identical to SEQ ID NO:9 

A nucleic acid segment encoding a fusion protein may comprise a nucleic acid 
sequence encoding a green fluorescent protein subunit; and a nucleic acid sequence 
encoding a 3-keto-acyl-CoA reductase protein subunit; wherein the nucleic acid sequence 
encoding a 3-keto-acyl-CoA reductase protein subunit is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO:8; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 8 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO:9; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO:9 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:9. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:8. The nucleic acid sequence may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein subunit at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%, 99%, 99.5%>, or 100% identical to SEQ ID NO:9. 
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phaC and PhaC 

A nucleic acid segment may comprise a nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein, wherein the nucleic acid sequence is selected 
from the group consisting of: a nucleic acid sequence at least about 80% identical to SEQ 
ID NO: 10; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID 
NO: 10 or the complement thereof; a nucleic acid sequence encoding a protein at least 
about 80% identical to SEQ ID NO:ll; and a nucleic acid sequence encoding a protein 
that is immunoreactive with an antibody prepared using SEQ ID NO: 1 1 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:l 1. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO: 10. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:l 1. 

An isolated polyhydroxyalkanoate synthase protein may comprise an amino acid 
sequence selected from the group consisting of: an amino acid sequence at least about 
80% identical to SEQ ID NO:ll; and an amino acid sequence that is immunoreactive 
with an antibody prepared using SEQ ID NO: 11 as an antigen, the antibody being 
immunoreactive with SEQ ID NO:l 1. The protein is preferably at least about 82%, 84%, 
86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO: 11 

A recombinant vector may comprise in the 5' to 3' direction: a) a promoter that 
directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein; b) a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein; wherein the structural nucleic acid sequence is 
selected from the group consisting of: a nucleic acid sequence at least about 80% 
identical to SEQ ID NO: 10; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 10 or the complement thereof; a nucleic acid sequence 
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encoding a protein at least about 80% identical to SEQ ID NO:ll; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO: 1 1 as an antigen, the antibody being immunoreactive with SEQ ID NO:l 1 ; and c) 
a 3' transcription terminator. More preferably, the nucleic acid sequence is at least about 
82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO: 10. The nucleic acid segment may be obtained from a natural source, may 
be mutagenized, may be genetically engineered by mutagenesis or other methods, or may 
be synthetic. The nucleic acid sequence preferably encodes a protein at least about 82%, 
84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO:l 1. The promoter may generally be any promoter, and more preferably is a tissue 
selective or tissue specific promoter. The promoter may be constitutive or inducible. The 
promoter may be a viral promoter. The promoter may be a CMV35S, enhanced 
CMV35S, an FMV35S, a Lesquerella hydroxylase, or a 7S conglycinin promoter. 

A recombinant host cell may comprise a nucleic acid segment encoding a 
polyhydroxyalkanoate synthase protein, wherein the nucleic acid segment is selected 
from the group consisting of: a nucleic acid sequence at least about 80% identical to SEQ 
ID NO:10; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID 
NO: 10 or the complement thereof; a nucleic acid sequence encoding a protein at least 
about 80% identical to SEQ ID NO:ll; and a nucleic acid sequence encoding a protein 
that is immunoreactive with an antibody prepared using SEQ ID NO: 1 1 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:l 1. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO: 10. The nucleic acid segment may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO:ll. The host cell may generally be any host 
cell, and preferably is a bacterial, fungal, mammalian, or plant cell. The bacterial cell is 
preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The 
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fungal cell is preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. 
The plant cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants 
such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa cell. 

A genetically transformed plant cell may comprise in the 5' to 3' direction: a) a 
promoter that directs transcription of a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein; b) a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein; wherein the structural nucleic acid sequence is 
selected from the group consisting of: a nucleic acid sequence at least about 80% 
identical to SEQ ID NO: 10; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 10 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:ll; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO: 1 1 as an antigen, the antibody being immunoreactive with SEQ ID NO:l 1; c) a 3' 
transcription terminator; and d) a 3' polyadenylation signal sequence that directs the 
addition of polyadenylate nucleotides to the 3' end of RNA transcribed from the 
structural nucleic acid sequence. More preferably, the nucleic acid sequence is at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
to SEQ ID NO: 10. The nucleic acid segment may be obtained from a natural source, may 
be mutagenized, may be genetically engineered by mutagenesis or other methods, or may 
be synthetic. The nucleic acid sequence preferably encodes a protein at least about 82%, 
84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO:l 1. The plant may generally be any plant, and more preferably a monocot, dicot, or 
conifer. The plant is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed 
plants such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa plant. 

A method of preparing host cells useful to produce a polyhydroxyalkanoate 
synthase protein may comprise a) selecting a host cell; b) transforming the selected host 
cell with a recombinant vector having a structural nucleic acid sequence encoding a 
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polyhydroxyalkanoate synthase protein, wherein the structural nucleic acid sequence is 
selected from the group consisting of: a nucleic acid sequence at least about 80% 
identical to SEQ ID NO: 10; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 10 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:ll; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:l 1 as an antigen, the antibody being immunoreactive with SEQ ID NO:l 1; and c) 
obtaining transformed host cells. More preferably, the nucleic acid sequence is at least 
about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical 
to SEQ ID NO: 10. The nucleic acid segment may be obtained from a natural source, may 
be mutagenized, may be genetically engineered by mutagenesis or other methods, or may 
be synthetic. The nucleic acid sequence preferably encodes a protein at least about 82%, 
84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO:l 1. The host cell may generally be any host cell, and preferably is a bacterial, fungal, 
mammalian, or plant cell. The bacterial cell is preferably an Escherichia coli, Bacillus, 
Pseudomonas, or Ralstonia eutropha cell. The fungal cell is preferably a Saccharomyces 
cerevisiae or Schizosaccharomyces pombe cell. The plant cell is preferably a tobacco, 
wheat, potato, Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil 
seed rape, sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa cell. 

A method of preparing plants useful to produce a polyhydroxyalkanoate synthase 
protein may comprise a) selecting a host plant cell; b) transforming the selected host plant 
cell with a recombinant vector having a structural nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein, wherein the structural nucleic acid sequence is 
selected from the group consisting of: a nucleic acid sequence at least about 80% 
identical to SEQ ID NO: 10; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 10 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:ll; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO: 11 as an antigen, the antibody being immunoreactive with SEQ ID NO:ll; c) 
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obtaining transformed host plant cells; and d) regenerating the transformed host plant 
cells. More preferably, the nucleic acid sequence is at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO: 10. The 
nucleic acid segment may be obtained from a natural source, may be mutagenized, may 
be genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence preferably encodes a protein at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:ll. The 
plant (and plant cell) may generally be any plant, and more preferably a monocot, dicot, 
or conifer. The plant is preferably a tobacco, wheat, potato, Arabidopsis, and high oil 
seed plants such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, 
peanut, sugarcane, switchgrass, or alfalfa plant. 

The invention also relates to fusion proteins. A fusion protein may comprise a 
green fluorescent protein subunit; and a polyhydroxyalkanoate synthase protein subunit; 
wherein the polyhydroxyalkanoate synthase protein subunit comprises an amino acid 
sequence selected from the group consisting of: an amino acid sequence at least about 
80% identical to SEQ ID NO: 11; and an amino acid sequence that is immunoreactive 
with an antibody prepared using SEQ ID NO: 11 as an antigen, the antibody being 
immunoreactive with SEQ ID NO:ll. The polyhydroxyalkanoate synthase protein 
subunit is preferably at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 
99%, 99.5%, or 100% identical to SEQ ID NO:l 1 

A nucleic acid segment encoding a fusion protein may comprise a nucleic acid 
sequence encoding a green fluorescent protein subunit; and a nucleic acid sequence 
encoding a polyhydroxyalkanoate synthase protein subunit; wherein the nucleic acid 
sequence encoding a polyhydroxyalkanoate synthase protein subunit is selected from the 
group consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID 
NO: 10; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID 
NO: 10 or the complement thereof; a nucleic acid sequence encoding a protein at least 
about 80% identical to SEQ ID NO:ll; and a nucleic acid sequence encoding a protein 
that is immunoreactive with an antibody prepared using SEQ ID NO:l 1 as an antigen, the 
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antibody being immunoreactive with SEQ ID NO:ll. More preferably, the nucleic acid 
sequence is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 
99.5%, or 100% identical to SEQ ID NO: 10. The nucleic acid sequence may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence preferably 
encodes a protein subunit at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 
98%, 99%, 99.5%, or 1 00% identical to SEQ ID NO: 1 1 . 
PHA biosynthesis methods: vhaB and phaC 

A method for the preparation of polyhydroxyalkanoate may comprise: a) 
obtaining a cell comprising: a nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; and a nucleic acid sequence encoding a PHA synthase protein; 
wherein: the nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein is not 
naturally found in the cell; the nucleic acid sequence encoding a PHA synthase protein is 
not naturally found in the cell; the nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:8; a nucleic acid sequence that hybridizes under 
stringent conditions to SEQ ID NO: 8 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:9; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:9 as an antigen, the antibody being immunoreactive with SEQ ID NO:9; and the 
nucleic acid sequence encoding a PHA synthase protein is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO: 10; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 10 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO:ll; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO: 11 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:ll; and b) culturing the cell under 
conditions suitable for the preparation of polyhydroxyalkanoate. The nucleic acid 
sequence encoding a 3-keto-acyl-CoA reductase protein more preferably is at least about 
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82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO: 8. The nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein 
may be obtained from a natural source, may be mutagenized, may be genetically 
engineered by mutagenesis or other methods, or may be synthetic. The nucleic acid 
sequence encoding a 3-keto-acyl-CoA reductase protein preferably encodes a protein at 
least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% 
identical to SEQ ID NO:9. The nucleic acid sequence encoding a PHA synthase protein 
more preferably is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 
99%, 99.5%, or 100% identical to SEQ ID NO: 10. The nucleic acid sequence encoding a 
PHA synthase protein may be obtained from a natural source, may be mutagenized, may 
be genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence encoding a PHA synthase protein preferably encodes a protein at 
•least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% 
identical to SEQ ID NO:ll. The cell may generally be any cell, and preferably is a 
bacterial, fungal, mammalian, or plant cell. The bacterial cell is preferably an 
Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The fungal cell is 
preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. The plant 
cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants such as 
corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, sugarcane, 
switchgrass, or alfalfa cell. The polyhydroxyalkanoate may be a homopolymer or 
copolymer. The polyhydroxyalkanoate may be a polyhydroxybutyrate, 
polyhydroxyvalerate, polyhydroxyhexanoate, polyhydroxyoctanoate, 

polyhydroxydecanoate, or copolymers thereof. 

A method for the preparation of polyhydroxyalkanoate may comprise: a) 
obtaining a plant comprising: a nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; and a nucleic acid sequence encoding a PHA synthase protein; 
wherein: the nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein is not 
naturally found in the plant; the nucleic acid sequence encoding a PHA synthase protein 
is not naturally found in the plant; the nucleic acid sequence encoding a 3-keto-acyl-CoA 
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reductase protein is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:8; a nucleic acid sequence that hybridizes under 
stringent conditions to SEQ ID NO: 8 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:9; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:9 as an antigen, the antibody being immunoreactive with SEQ ID NO:9; and the 
nucleic acid sequence encoding a PHA synthase protein is selected from the group 
consisting of: a nucleic acid sequence at least about 80% identical to SEQ ID NO: 10; a 
nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 10 or the 
complement thereof; a nucleic acid sequence encoding a protein at least about 80% 
identical to SEQ ID NO: 11; and a nucleic acid sequence encoding a protein that is 
immunoreactive with an antibody prepared using SEQ ID NO: 11 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:ll; and b) growing the plant under 
conditions suitable for the preparation of polyhydroxyalkanoate. The nucleic acid 
sequence encoding a 3-keto-acyl-CoA reductase protein more preferably is at least about 
82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to 
SEQ ID NO:8. The nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein 
may be obtained from a natural source, may be mutagenized, may be genetically 
engineered by mutagenesis or other methods, or may be synthetic. The nucleic acid 
sequence encoding a 3-keto-acyl-CoA reductase protein preferably encodes a protein at 
least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% 
identical to SEQ ID NO:9. The nucleic acid sequence encoding a PHA synthase protein 
more preferably is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 
99%, 99.5%, or 100% identical to SEQ ID NO: 10. The nucleic acid sequence encoding a 
PHA synthase protein may be obtained from a natural source, may be mutagenized, may 
be genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence encoding a PHA synthase protein preferably encodes a protein at 
least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% 
identical to SEQ ID NO:ll. The plant is preferably a tobacco, wheat, potato, 
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Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil seed rape, 
sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. The 
polyhydroxyalkanoate may be a homopolymer or copolymer. The polyhydroxyalkanoate 
may be a polyhydroxybutyrate, polyhydroxyvalerate, polyhydroxyhexanoate, 
polyhydroxyoctanoate, polyhydroxydecanoate, or copolymers thereof. 
PHA biosynthesis methods: vhaB 

A method for the preparation of polyhydroxyalkanoate may comprise: a) 
obtaining a cell comprising: a nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; and a nucleic acid sequence encoding a PHA synthase protein; 
wherein: the nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein is not 
naturally found in the cell; the nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:8; a nucleic acid sequence that hybridizes under 
stringent conditions to SEQ ID NO:8 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:9; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:9 as an antigen, the antibody being immunoreactive with SEQ ID NO:9; and b) 
culturing the cell under conditions suitable for the preparation of polyhydroxyalkanoate. 
The nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein more preferably 
is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 
100% identical to SEQ ID NO:8. The nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein preferably encodes a 
protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 
100% identical to SEQ ID NO:9. The cell may generally be any cell, and preferably is a 
bacterial, fungal, mammalian, or plant cell. The bacterial cell is preferably an 
Escherichia coli, Bacillus, Pseudomonas, or Ralstonia eutropha cell. The fungal cell is 
preferably a Saccharomyces cerevisiae or Schizosaccharomyces pombe cell. The plant 
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cell is preferably a tobacco, wheat, potato, Arabidopsis, and high oil seed plants such as 
corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, peanut, sugarcane, 
switchgrass, or alfalfa cell. The polyhydroxyalkanoate may be a homopolymer or 
copolymer. The polyhydroxyalkanoate may be a polyhydroxybutyrate, 
polyhydroxyvalerate, polyhydroxyhexanoate, polyhydroxyoctanoate, 

polyhydroxydecanoate, or copolymers thereof. 

A method for the preparation of polyhydroxyalkanoate may comprise: a) 
obtaining a plant comprising: a nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; and a nucleic acid sequence encoding a PHA synthase protein; ' 
wherein: the nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein is not 
naturally found in the plant; the nucleic acid sequence encoding -a 3-keto-acyl-CoA 
reductase protein is selected from the group consisting of: a nucleic acid sequence at least 
about 80% identical to SEQ ID NO:8; a nucleic acid sequence that hybridizes under 
stringent conditions to SEQ ID NO:8 or the complement thereof; a nucleic acid sequence 
encoding a protein at least about 80% identical to SEQ ID NO:9; and a nucleic acid 
sequence encoding a protein that is immunoreactive with an antibody prepared using SEQ 
ID NO:9 as an antigen, the antibody being immunoreactive with SEQ ID NO:9; and b) 
growing the plant under conditions suitable for the preparation of polyhydroxyalkanoate. 
The nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein more preferably 
is at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 
100% identical to SEQ ID NO:8. The nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein may be obtained from a natural source, may be mutagenized, may be 
genetically engineered by mutagenesis or other methods, or may be synthetic. The 
nucleic acid sequence encoding a 3-keto-acyl-CoA reductase protein preferably encodes a 
protein at least about 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 
100% identical to SEQ ID NO:9. The plant may generally be any plant, and preferably is 
a tobacco, wheat, potato, Arabidopsis, high oil seed plants such as corn, soybean, canola, 
oil seed rape, sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa plant. 
The polyhydroxyalkanoate may be a homopolymer or copolymer. The 
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polyhydroxyalkanoate may be a polyhydroxybutyrate, polyhydroxyvalerate, 
polyhydroxyhexanoate, polyhydroxyoctanoate, polyhydroxydecanoate, or copolymers 
thereof. 

PHA biosynthesis methods: vhaC 

A method for the preparation of polyhydroxyalkanoate may comprise: a) 
obtaining a cell comprising: a nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; and a nucleic acid sequence encoding a PHA synthase protein; 
wherein: the nucleic acid sequence encoding a PHA synthase protein is not naturally 
found in the cell; the nucleic acid sequence encoding a PHA synthase protein is selected 
from the group consisting of: a nucleic acid sequence at least about 80% identical to SEQ 
ID NO: 10; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID 
NO: 10 or the complement thereof; a nucleic acid sequence encoding a protein at least 
about 80% identical to SEQ ID NO:ll; and a nucleic acid sequence encoding a protein 
that is immunoreactive with an antibody prepared using SEQ ID NO:l 1 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:ll; and b) culturing the cell under 
conditions suitable for the preparation of polyhydroxyalkanoate. The nucleic acid 
sequence encoding a PHA synthase protein more preferably is at least about 82%, 84%, 
86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO: 10. The nucleic acid sequence encoding a PHA synthase protein may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence encoding 
a PHA synthase protein preferably encodes a protein at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:ll. The cell 
may generally be any cell, and preferably is a bacterial, fungal, mammalian, or plant cell. 
The bacterial cell is preferably an Escherichia coli, Bacillus, Pseudomonas, or Ralstonia 
eutropha cell. The fungal cell is preferably a Saccharomyces cerevisiae or 
Schizosaccharomyces pombe cell. The plant cell is preferably a tobacco, wheat, potato, 
Arabidopsis, and high oil seed plants such as corn, soybean, canola, oil seed rape, 
sugarbeet, sunflower, flax, peanut, sugarcane, switchgrass, or alfalfa cell. The 
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polyhydroxyalkanoate may be a homopolymer or copolymer. The polyhydroxyalkanoate 
may be a polyhydroxybutyrate, polyhydroxyvalerate, polyhydroxyhexanoate, 
polyhydroxyoctanoate, polyhydroxydecanoate, or copolymers thereof. 

A method for the preparation of polyhydroxyalkanoate may comprise: a) 
obtaining a plant comprising: a nucleic acid sequence encoding a 3-keto-acyl-CoA 
reductase protein; and a nucleic acid sequence encoding a PHA synthase protein; 
wherein: the nucleic acid sequence encoding a PHA synthase protein is not naturally 
found in the plant; the nucleic acid sequence encoding a PHA synthase protein is selected 
from the group consisting of: a nucleic acid sequence at least about 80% identical to SEQ 
ID NO: 10; a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID 
NO: 10 or the complement thereof; a nucleic acid sequence encoding a protein at least 
about 80% identical to SEQ ID NO:ll; and a nucleic acid sequence encoding a protein 
that is immunoreactive with an antibody prepared using SEQ ID NO:l 1 as an antigen, the 
antibody being immunoreactive with SEQ ID NO:ll; and b) growing the plant under 
conditions suitable for the preparation of polyhydroxyalkanoate. The nucleic acid 
sequence encoding a PHA synthase protein more preferably is at least about 82%, 84%, 
86%, 88%, 90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID 
NO: 10. The nucleic acid sequence encoding a PHA synthase protein may be obtained 
from a natural source, may be mutagenized, may be genetically engineered by 
mutagenesis or other methods, or may be synthetic. The nucleic acid sequence encoding 
a PHA synthase protein preferably encodes a protein at least about 82%, 84%, 86%, 88%, 
90%, 92%, 94%, 96%, 98%, 99%, 99.5%, or 100% identical to SEQ ID NO:ll. The 
plant may generally be any plant, and preferably is a tobacco, wheat, potato, Arabidopsis, 
high oil seed plants such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, 
flax, peanut, sugarcane, switchgrass, or alfalfa plant. The polyhydroxyalkanoate may be 
a homopolymer or copolymer. The polyhydroxyalkanoate may be a 
polyhydroxybutyrate, polyhydroxyvalerate, polyhydroxyhexanoate, 

polyhydroxyoctanoate, polyhydroxydecanoate, or copolymers thereof. 
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Methods for preparing higher polyhvdroxyalkanoates 

Polyhydroxyalkanoate may be prepared by a method comprising: a) obtaining a 
recombinant host cell comprising: a nucleic acid sequence encoding a p-ketothiolase 
protein; a nucleic acid sequence encoding a 3-ketoacyl-CoA reductase protein; a nucleic 
acid sequence encoding a polyhydroxyalkanoate synthase protein; a nucleic acid 
sequence encoding a P-hydroxyacyl-CoA dehydrase; and a nucleic acid sequence 
encoding an acyl-CoA dehydrogenase protein or an enoyl-CoA reductase protein; and b) 
culturing the recombinant host cell under conditions suitable for the preparation of 
polyhydroxyalkanoate; wherein: the polyhydroxyalkanoate comprises C6, C8, or CIO 
monomer subunits; the nucleic acid sequence encoding a 3-keto-acyl-CoA reductase 
protein is selected from the group consisting of: a nucleic acid sequence at least about 
80% identical to SEQ ID NO:8; a nucleic acid sequence that hybridizes under stringent 
conditions to SEQ ID NO: 8 or the complement thereof; a nucleic acid sequence encoding 
a protein at least about 80% identical to SEQ ID NO:9; and a nucleic acid sequence 
encoding a protein that is immunoreactive with an antibody prepared using SEQ ID NO:9 
as an antigen, the antibody being immunoreactive with SEQ ID NO:9. 

Primers, probes, and antibodies 

The sequences disclosed in the sequence listing may also be used to prepare 
primers, probes, and monoclonal or polyclonal antibodies. 

SEQ ID NOS:l, 2, 4, 6, 8, 10, 22, 24, 26, and 28, and the their complementary 
strands may be used to design oligonucleotide primers and probes. Primers and probes 
are typically at least 15 nucleotides in length, and more preferably are at least 20, 22, 24, 
26, 28, 30, 40, or 50 nucleotides in length. Contiguous nucleotide sequences from a 
given sequence are chosen based upon favorable hybridization conditions, including 
minimization of hairpin or other detrimental sequences. The identification of suitable 
primer or probe sequences is well known to those of skill in the art, and is facilitated by 
commercially available software such as Mac Vector (Oxford Molecular Group) and 
Xprimer (http://alces.med.umn.edu/rawprimer.html). Primers and probes may be used for 
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the screening of libraries, for PCR amplification, and other routine molecular biological 
applications. Primers and probes may also be used for antisense applications. 

SEQ ID NOS:3, 5, 7, 9, 1 1, 23, 25, 27, and 29 may be used for the generation of 
monoclonal or polyclonal antibodies. The entire sequences may be used, or antigenic 
fragments thereof. Alternatively, portions of the full length sequences may be 
synthesized and covalently attached to antigenic proteins such as keyhole limpet 
hemocyanin (KLH). Portions of the full length sequences may be used for the 
preparation of multi-antigenic peptides (52). The generation of monoclonal and 
polyclonal antibodies is well known to those of skill in the art. 

The following Examples are included to demonstrate preferred embodiments of 
the invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventors 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 
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EXAMPLES 
Example 1 : Bacterial strains and plasmids 



Table 1. Strains 



Strains 


Relevant characteristics' 1 


Source or 
Reference 


E. coli DH5ct 


deoR endAl gyrA96 hsdR17 (r{ m k + ) recAl relAl supE44 thi-1 A:(lacZYA- 
argFV169) ^OlacZAM 1 5 PA,". Cloning host and for expression of pha 
genes 


^lunieLii 


B. megaterium 
11561 


Wild type, used to clone pha genes 


ATCC b 


B. megaterium 
PHA05 


phaP, -Q, -R, -B and -C deletion derivative of B. megaterium 1 1561 


This 
Application 


P. oleovorans 
29347 


PHA positive control 


ATCC 


P. putida 
GPpl04 


PHA negative mutant obtained by chemical mutagenesis 


(22) \ 
1 


Table 2. Plasmids 




Plasmids 


Relevant characteristics' 1 


Source or 
Reference 


pBluescriptllSK 


Cloning vector, ColEl oriV c , Amp r 


Stratagene 


pGFPuv 


Source of gfp gene, ColEl oriV, Amp r 


Clontech 


pHPS9 


Bacillus-Escherichia coli shuttle vector, ColEl and pTA1060 onV, Em , Cm 


(16) 


pSUP104 


Pseudomonas-Escherichia coli shuttle vector, Q-type and mini 15 oriV, Em , 
Tc r 


(40) 


pGMl 


EcoRl in phaP to Hindlll in phaC, cloned into the EcoRhHindlU sites of 
pBluescriptllSK, Amp r 


This 
application 


pGM6 


Pst\ in phaB to EcoRl mykrM, cloned into the Pstl-EcoRl sites of 
pBluescriptllSK, Amp r 


This 
application 


pGM7 


EcoRl mphaP to EcoRl mykrM, cloned into the EcoRl site of pBluescriptllSK, 
Amp r 


This 
application 


pGM9 


Hindlll upstream of ykoY to Pst\ in phaB, cloned into the Hindlll -Pst\ sites of 
pBluescriptllSK, Amp r 


This 
application 


pGMlO 


Hindlll upstream of ykoY to EcoRl mykrM, cloned into the Hindlll -EcoRl 
sites of pBluescriptllSK, Amp r 


This 
application 


pGM7H 


EcoRl in phaP to EcoRl mykrM, cloned into the EcoRl site of pHPS9, Cm 


This 
application 


pC/GFP2 
| pC/GFP3 


PhaC::GFP out-of-frame fusion plasmid. 

Fragment shown in Figure 4A cloned in pBluescriptllSK, Amp r 

PhaC::GFP in-frame fusion plasmid. 


This 
application 
This 
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pGM13 



pGM13C 



PhaC::GFP in-frame fusion plasmid. 
Fragment shown in Figure 4C cloned in pHPS9, Em r Lm r 



pP/GFP3 



pGM16.2 



pGM107 



pDRl 



pGM61 



pGM73 



GFP localization control plasmid. Part of phaB mdphaC deleted. 
Fragment shown in Figure 4D cloned in pHPS9, Em r Lm r 



PhaP::GFP in-frame fusion plasmid. 
Fragment shown in Figure 4E cloned in pBluescriptHSK, Amp r 



PhaP::GFP in-frame fusion plasmid. 
Fragment shown in Figure 4F cloned in pHPS9, Em r Lm r 



This 
application 



This 
application 



This 
application 



EcoRl in phaP to EcoR\ vxykrM, cloned as a BamHl-Sall fragment from 
pGM7, into the BamHl and Sail sites of pSUP104, Cm r 



Pst\ mphaB to EcoRl mykrM, cloned as a SmaVEcoRN fragment from pGM6 
into the two Dra\ sites of pSUP104 in same orientation as the Cm gene, with 
phaC expressed from the Cm promoter, Tc r 



Derived from pGM13. It carries an in-frame 594 bp deletion in phaR, 

extending from 96 bp upstream of the phaR initiation codon through codon 
144. 



Derived from pGM61 . Carries a transcriptional fusion between the promoter of 
phaP and the coding region plus translation signals of phaR. A 663 bp DNA 
fragment harboring phaR was cloned into the SnaBl site in phaP in the sense 
orientation 



This 
application 



This 
application 



This 
application 



This 
Application 



This 
Application 



°Em' erythromycin resistant; Lm', lincomycin resistant; Cm', chloramphenicol resistant; Amp', ampicillin 
resisiant; Tc', Tetracycline resistant. "ATCC, American Type Culture Collection. 'Ong.n of replication. 

Example 2: Media and prnwth conditions 

Cultures were grown at 37°C (unless otherwise stated) in liquid media, aerated by 
rotation at 250 rpm in either Luria-Bertani (LB) broth (33) or M9 Minimal Salts (Life 
Technologies, Bethesda, MD) with 1% (w/v) glucose. For growth on plates, the above 
media with 1.5% agar (Sigma, A4550) was used. For plasmid selections, the appropriate 
antibiotics were included in the media: ampicillin (200 ug/mL [AMP 200 ]), 
chloramphenicol (25 ug/mL [CM 25 ]), erythromycin (200 ug/mL [EM 200 ]), or tetracycline 
(12.5 ug/mL [TC 125 ]) for plasmid selection in Escherichia coli; chloramphenicol (12 
ug/mL [CM 12 ]), or erythromycin (1 ug/mL [EM']) plus lincomycin (25 ^ig/mL [LM 25 ]) 
for plasmid selection in Bacillus megaterium; chloramphenicol (160 ug/mL [CM 160 ]), or 
tetracycline (30 ug/mL [TC 30 ]) for selection in Pseudomonas. 

Example 3: Transformations 

Escherichia coli and Pseudomonas putida were transformed by electroporation of 
competent cells using an electroporator (Eppendorf) and following the manufacturers 
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instructions. Bacillus megaterium was transformed using a biolistic transformation 
procedure (39). 



Example 4: Microscopy 

For phase contrast microscopy, wet mounts of cultures were visualized at xl,000 
magnification in a light microscope with phase contrast attachments (Labophot-2 
Microscope, Nikon, Inc.). To view PHA inclusion-bodies, samples were heat fixed, 
stained with 1% (w/v) Nile Blue A (Sigma) for 15 minutes at 55°C, destained for 30 
seconds in 8% (v/v) acetic acid, water washed, air dried, and viewed at xlOOO 
magnification under fluorescence using filters; excitation, 446/10 nm; barrier filter, 590 
nm; dichroic mirror, 580 nm. To view GFP, wet mounts of cultures with or without 1% 
(w/v) agarose were viewed at xlOOO magnification under fluorescence using filters; 
excitation, 390-450 nm; barrier filter, 480-520 nm; dichroic mirror, 470 nm. 

Example 5: Codon usat^e in Bacillus megaterium 

Bacillus megaterium uses three codons as start codons in protein coding 
sequences. ATG, TTG, and GTG all encode methionine when present at the start of a 
coding region. TTG and GTG encode leucine and valine when present within a coding 
region, respectively. Bacillus megaterium uses TGA, TAA, and TAG as stop codons. 

Bacillus megaterium sequences starting with TTG or GTG may require 
mutagenesis to ATG if the sequences are to be expressed in organisms that use ATG 
exclusively as a start codon. 

Exam ple 6: Separation of polypeptides asso ciated with PHA inclusion-bodies. 

In an attempt to determine their relevance, proteins that co-purify with PHA 
inclusion-bodies were separated by electrophoreses on an SDS-polyacrylamide gel 
(Figure 1). 

Inclusion-bodies were purified (32) followed by suspension in TE buffer (10 mM 
Tris-HCl pH 8, 1 mM EDTA) with 2% (w/v) SDS. An equal volume of 2x sample buffer 
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(100 mM Tris-HCl (pH 6.8), 4% SDS, 4 mM EDTA, 20% glycerol, 2% 2- 
mercaptoethanol, 0.1% bromophenol blue) was added prior to boiling for 5 minutes and 
samples were centrifuged for 3 minutes to pellet PHA; the supernatant was loaded on a 
12% SDS-polyacrylamide gel and run at 8 mA overnight at 4°C to separate proteins. The 
gel was stained with Coomassie Blue for 5 minutes prior to transfer of proteins to a 
polyvinylidene difluoride membrane using a semi-dry electroblotter at 400 mA for 45 
minutes. 

There were at least thirteen such proteins present in various quantities. Some or 
. all of these proteins could be intrinsic structural components of PHA inclusion-bodies, 
enzymes involved with PHA metabolism or possibly scaffolding components involved in 
inclusion-body assembly. Alternatively, they could have been acquired by the inclusion- 
bodies during the purification procedure. The three most abundant proteins had 
molecular weights of approximately 14, 20 and 41 kDa. 

The N-terminal amino acid sequence for the three most prevalent proteins were 
determined. Membrane carrying the proteins of interest was cut for use in N-terminal 
amino acid sequence determination by Edman Degradation using a minimum quantity of 
200 pmols of each protein. The N-terminal amino acid sequence of the 14 kDa protein 
was KVFGRXELAAAMKRXGL (SEQ ID NO:19), the 20 kDa protein was 
NTVKYXTVIXAMXXQ (SEQ ID NO:20), and the 41 kDa proteins was 
AIPYVQEXEKL (SEQ ID NO:21). A BLASTp search ((1), performed with NCBI 
Entrez database; http://www.ncbi.nlm.nih.gov/Entrez/) revealed that the 14 kDa protein 
was lysozyme and the other two N-terminal sequences were novel. It was concluded that 
the lysozyme used in the cell lysis procedure had co-purified with the PHA inclusion 
bodies. This result confirms that not necessarily all of the proteins that co-purify with 
PHA inclusion-bodies are associated with them in vivo, as was also shown for 
Chromatium vinosum (27). 
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Example 7: Cloning the vha region 



Purification of genomic and plasmid DNA, Southern blot, hybridization and 
cloning were by standard procedures (38). To clone the DNA sequences that coded for 
the two most abundant proteins on purified PHA inclusion-bodies, degenerate 
oligonucleotide probes based on their N-terminal amino acid sequences were used. The 
probes were: AAYACRGTNAAATAYNNNACRGTNATYNNNGCDATGATG (n2, 
SEQ ID NO: 12) and GCDATYCCDTAYGTNCARGAAGGHTTYAAA (n5, SEQ ID 
NO:13) for the 20 kDa and 41 kDa proteins, respectively (Figure 1). 

Both probes, used in separate 38°C Southern blotting hybridization experiments, 
identified a 6.4 kb Hindlll, a 5.2 kb EcoRI, and a 3.7 kb Hindlll to EcoRI DNA fragment 
of DNA, indicating that the 5' ends of the coding regions for both of these proteins were 
located less than 3.7 kb apart in the genome. The three fragments were purified from 
agarose following electrophoresis, and cloned into plasmid pBluescriptllSK. 

Positive clones were identified by hybridization to the same degenerate probes, 
thus yielding plasmid pGMl containing the 3.7 kb fragment. Sequences contiguous with 
and overlapping this primary cloned fragment were cloned in a similar manner except 
that probes based on the ends of the sequenced DNA fragment were used, and 
hybridization was performed at 55°C. The probes used were 
GCTTCATGCGTGCGGTTTG (bmp, SEQ ID NO: 14) and 
GGACCGTTCGGAAAATCAGCGG (bmc, SEQ ID NO: 15), yielding respectively, 
pGM9 and pGM6 (Figure 2). 

DNA fragments of pGMl, pGM6 and pGM9 were subcloned into 
pBluescriptllSK, and sequenced, from both ends using universal primers and internally 
by primer walking on both strands, using dye terminator chemistry, cycle sequencing and 
an ABI Prism 377 sequencer (Applied Biosystems). Sequence assembly and analysis was 
performed using Lasergene (DNAStar, Inc.), and Gapped BLAST and PSI BLAST (1). 

The 3.7 kb fragment contained 5 ORFs (Figure 2), whose predicted amino acid 
sequences encode PhaP (20 kDa protein), PhaQ, PhaR, PhaB and PhaC (41 kDa protein). 
The 20 and 41 -kDa proteins were identified by their N-terminal amino acid sequences. 
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Since the C-terminus for each of these two proteins extended beyond the boundaries of 
pGMl, the remaining sequence were obtained from plasmids pGM6 and pGM9. 

Example 8: The pha locus. 

The 7,916 bp region (SEQ ID NO:l) containing pha genes from Bacillus 
megaterium was cloned, sequenced and characterized. It was shown to carry 8 complete 
and 1 incomplete open reading frame (Figure 2, Tables 3 and 4). Coding sequences in 
this region were assigned on the basis of homology to known sequences, N-terminal 
amino acid sequences, putative ribosome binding sites and operon location. The 
complement and arrangement of genes flanking the pha genes in Bacillus megaterium are 
very similar to a region of Bacillus subtilis 168 (Figure 2). This strain is negative for 
PHA and no known pha genes or sequences occur in its genome, for which the complete 
sequence is available (24). In place of pha genes in this region of B. subtilis are ykrl, 
ykrK and ykrl, which, respectively, code for putative proteins similar to two unknown 
proteins, and a probable heat shock protein. 



Table 3: Sequence analysis results 



Sequence 


Number of amino acids 


Mol mass Daltons 


Isoelectric point 


ykoY 


271 


29,996 


6.89 


ykoZ 


236 


27,662 


9.36 


sspD 


65 


7,027 


8.58 


phaP 


170 


19,906 


5.29 


phaQ 


146 


16,686 


5.09 


phaR 


168 


19,150 


5.10 


phaB 


247 


26,098 


7.39 


phaC 


362 


41,463 


8.31 


ykrM 


318 a 


ND 


ND 



'Partial protein. 
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Table 4: Sequence homologies 



Sequence 


Homologies IU KJ1UWI1 aliu 
nutntivp oprips ^accession no ^ a 


THpntitv 


Similarity 

L_J 1.111 1.1141 


Function or putative function 


yKoi 


I K.U I , LJ . oUUllllo yiL^yy i ivy 


64% 


73% 


Toxic anion resistance protein (24) 


ykoZ 


YkoZ, B. subtilis (Z99111) 


57% 


74% 


RNA polymerase sigma factor (24) 


sspD 


SspD, Bacillus megaterium 
SspD, B. subtilis (P04833) 


100% 
73% 


87% 


Spore specific, DNA binding 
protein (4, 10) 


phaP 


None 






PHA inclusion-body structure, 
shape and size (49) 


phaQ 


None 






Unknown 


phaR 


None 






Unknown 


phaB 


FabG, Synechocystis (D90907) 
PhaB, C. vinosum D (P45375) 
FabG, B. subtilis (P5\%3\) 


50% 
48% 
47% 


66% 
64% 
67% 


Fatty acid biosynthesis (23) 
3-ketoacyl-CoA reductase (28) 
Fatty acid biosynthesis (35) 


phaC 


PhaC, T. violacea (P45366) 
PhaC, Synechocystis (D90906) 
PhaC, C vinosum (P45370) 


38% 
37% 
35% 


59% 
56% 
55% 


PHA synthase (29, 23, 28) 


ykrM 


YkrM, B. subtilis (Z99\\\) 


55% 


71% 


Na + -transporting ATP synthase 
(24) 



'Accession numbers are SWISS-PROT, EMBL or DDBJ; b None, No discernible similarity to known 



sequences. 

Example 9: The vha nucleic acid and encoded protein sequences 

The deduced amino acid sequence of PhaP shows a 20 kDa extremely hydrophilic 
product with no obvious similarity to known sequences (Figure 6). Inclusion-body 
associated low molecular weight proteins (phasins) have been described in many bacteria 
(49), but where sequences were available no similarities of identifiable significance with 
PhaP of Bacillus megaterium were found. 

Low molecular weight, PHA inclusion-body abundant proteins play an important 
role in PHA producing cells, since they are involved in determining inclusion-body size 
and shape, and are present in quantities up to 5% of total protein in the case of PHA 
producing A. eutrophus (48). It is an interesting observation that the amino acid 
sequences of phasin proteins are so dissimilar, even in closely related bacteria. Some 
similarity between such proteins would be expected in closely related bacteria, were they 
to have a role in inclusion-body biogenesis, however, conservation of sequence would be 
entirely unnecessary should they have a role as storage proteins. 
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The deduced amino acid sequences of PhaQ and PhaR also revealed small 
hydrophilic proteins with no significant identifiable similarity to known proteins (Figures 
7 and 8). Figure 1 (lane 2) shows that purified inclusion-bodies have proteins represented 
by bands of the approximate sizes of PhaQ (17 kDa) and PhaR (19 kDa), but the roles of 
these proteins are unknown. They may be non-orthologous replacements for the small 
putative gene products, whose roles are also unknown, coded in known pha gene clusters. 
The deduced amino acid sequence of PhaB, is similar in size and amino acid sequence to 
known phaB and fabG gene products (Table 2). The deduced amino acid sequence of 
PhaC shows that while it has low homology overall to known PhaC proteins, it is most 
similar to that of T. violacea, Synechocystis and C. vinosum. PhaC proteins from these 
three bacterial strains, respectively, have 355, 378, and 355 amino acids while PhaC from 
Bacillus megaterium has 362 amino acids. All other PhaC proteins studied are larger in 
size, and range from 559 amino acids for that of P. oleovorans (22) to 636 amino acids 
for that of Rhizobium etli (3). Alignment studies of sequences of all previously known 
PhaC proteins show that the synthases are either large single subunit enzymes (PhaC) or 
smaller two subunit enzymes (PhaC and PhaE). The Bacillus megaterium PhaC protein 
aligns poorly with large, single subunit enzymes such as the P. oleovorans PhaC (Figure 
3). 

Example 10: Functionality of the nha gene cluster 

It has been demonstrated that the phaP, -Q, -R, -B and -C gene cluster can 
complement a deletion mutant of B. megaterium. This mutant PHA05 was constructed 
by a gene substitution technique. A plasmid (based on pGMlO) in which the pha genes 
were substituted by the erythromycin gene, was propagated in B. megaterium 11561. 
Selection on erythromycin allowed isolation of the PHA05 mutant that was negative for 
PHA synthesis. Complementation with the phaP, -Q, -R, -B and -C gene cluster was 
obtained when pGM7H or pGM13 was introduced into the PHA05 strain. 

Experiments introducing a phaR deletion of pGM13 (pGM61) into PHA05 
suggests that the presence of phaR may be preferred for PHA synthesis. This result was 
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confirmed by the recloning of phaR into pGM61 (pGM73) as it was isolated from 
PHA05(pGM61) strain, followed by the introduction of pGM73 into PHA05. 
Accumulation of PHA in PHA05(pGM73) confirmed the preference for phaR. It has 
been previously demonstrated that the small type PhaCs (see Example 17) is not 
sufficient for PHA synthesis; another peptide, PhaE of approximate size 30 kDa, is also 
required (51). These complementation studies suggest that it is preferable to combine 
PhaC of B. megaterium (also a small type PhaC) and phaR (19 kDa), however there is no 
sequence similarity between phaR ofB. megaterium and phaE of other organisms. 

Example 1 1 : Mapping transcription starts 

The transcription start points were mapped in the region from the EcoRI 
restriction site in phaP to the Hindlll site in ykrM by primer extension analysis, using the 
Promega system for primer extension on RNA templates. DNA oligonucleotide primers, 
17 to 20 nucleotides in length, were synthesized to match target sequences, initially at 
approximately 500 base pair intervals and subsequently at about 50 to 250 nucleotides 
down-stream from the predicted transcription start points. The 32 P 5' end-labeled primers 
were extended with reverse transcriptase using total RNA (10 [ig per reaction) purified 
from Bacillus megaterium (31). The fragment length initially, and transcription start 
nucleotides subsequently, were determined by running the cDNA on a 8% denaturing 
polyacrylamide gel along-side the products of sequencing reactions, which were 
generated using the same 5 '-end labeled primers. The primers used to identify the 
transcription start nucleotides for the phaP, phaQ, and phaRBC promoters were, 
respectively, CCCCTTTGTCCATTGTTCCC (SEQ ID NO: 16); 
CCATGTAGATTCCACCCTC (SEQ ID NO:17); and CTCCATCTCCTTTCTTGTG 
(SEQ ID NO: 18). 

Primer extension products showed a single band from each reaction, indicating 
one transcript, while control reactions in which RNA was omitted showed no bands. The 
extension products run alongside sequencing reaction products obtained with the same 
primer (Figure 2C), identified the 5' ends of the transcripts thus allowing the putative 
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promoter sequences at approximately -10 and -35 -bp for phaP, -Q and -R to be 
identified. The arrangement of genes in the pha cluster of Bacillus megaterium is unique 
among those already published and phaA is notably absent. ThephaP, -Q, -R, -B and -C 
genes were shown to be in a 4,104 -bp region, with phaP and -Q transcribed in one 
orientation, each from a separate promoter, while phaR, -B and -C were divergently 
transcribed from a promoter in front of phaR. The putative promoters responsible for 
transcription of phaQ and phaR, phaB and phaC show strong similarity to both Bacillus 
subtilis Sigma A type (34) and Escherichia coli, Sigma 70 type promoters (14), which 
can express constitutively. This is in keeping with previous data for Alcaligenes 
eutrophus showing that phaC is constitutively synthesized, but PHA is not constitutively 
accumulated (19). The third putative promoter in this region, the phaP promoter, 
resembles a Sigma D (SigD) type promoter known to control the expression of a regulon 
of genes associated with flagellar assembly, chemotaxis and motility (13, 20, 46). In 
Bacillus subtilis Sigma D is expressed in the exponential phase and peaks in late 
exponential phase of growth. This parallels the pattern of PHA accumulation previously 
described for Bacillus megaterium 11561 (32). However, further experiments are 
required to test the hypothesis that PHA accumulation in regulated by sigma D or 
products of its resulting transcripts. The phaP gene has 18 -bp duplicate sequences that 
could base-pair to form a rfco-independent terminator close to its translational stop codon 
(Figure 2B). The fact that the -35 promoter region of sspD is within this putative hairpin 
structure, suggests that transcription of phaP and sspD could be mutually exclusive, thus 
allowing the expression oiphaP to play a regulatory role in the expression of sspD (spore 
specific storage protein). 

Example 12: Expression of Bacillus megaterium pha g enes in Escherichia coli and 
Pseudomonas vutida 

Functionality of the Bacillus megaterium putative pha gene cluster was tested in 
Escherichia coli, which is naturally PHA negative, and Pseudomonas putida GPpl04, a 
phaC" mutant. Plasmids carrying one or more of these genes were introduced and the 
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resulting transformants were tested for PHA accumulation following growth on LB or 
M9 medium with various carbon sources and the appropriate antibiotic for plasmid 
selection. 

Triplicate 500 mL cultures, were grown in 2 liter flasks at 30°C, rotating at 250, 
using 1% inocula of 16 hour cultures, which had been grown in LB, centrifuged and 
resuspended in equal volumes of 0.9% saline. At 48 hours samples were removed for 
microscopy and cells were harvested, washed once in dH 2 0 and lyophilized. For PHA 
extraction, lyophilized cells were suspended in 10 volumes of 5% (w/v) bleach, shaken at 
65°C for 1 hour and centrifuged. The pellet was resuspended in 10 volumes of 5% bleach 
and centrifuged followed by sequentially washing in water and 95% ethanol. The amount 
of PHA is expressed as percent PHA per mass of vacuum dried cells (w/w). 

Escherichia coli carrying pGM7 or pGMlO accumulated low levels of PHA while 
Escherichia coli carrying pGMl or pGM6 accumulated no PHA. Fluorescence 
microscopy of Nile Blue A stained cells showed approximately 1 cell in 20 had one or 
several inclusion-bodies and the quantity of PHA produced was approximately 5% of cell 
dry weight. Since Escherichia coli does not have PhaA, a low level or no PHA is the 
expected result. However, in Pseudomonas where PhaA is not known to be required, 
Pseudomonas putida GPpl04 (pGM107) accumulated PHA on rich as well as minimal 
medium with various carbon sources to >50% of cell dry weight, and 90 to 100% of cells 
appeared full of PHA (Table 5). The positive control P. oleovorans, (equivalent to wild- 
type Pseudomonas putida) accumulated PHA only when grown on longer chain carbon 
sources, and not on LB. No PHA was accumulated by the negative control or by 
Pseudomonas putida carrying phaC alone (pDRl). These results showed that this 
Bacillus megaterium gene cluster is functional in both Escherichia coli and Pseudomonas 
putida. It is not known if the negative results obtained with pDRl was due to PhaC alone 
being insufficient to complement PhaC" Pseudomonas putida or to synthesize PHA in 
Escherichia coli, or if the expression of phaC on pDRl was not successful in producing 
protein. 
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Table 5: Cells with PHA as a percent 1 of total cells following growth on different carbon 

sources 



Substrates 
(no. C atoms) 


Source of 
genes: 


Positive 
control: 


Negative 
control, 
vector only: 


phaP"QRBC: 


phaC: 




Bacillus 
megaterium 


P. oleovorans 


Pseudomonas 
putida G?y\M 
(pSUP104) 


Pseudomonas 
putida GPpl04 
(pGM107) 


Pseudomonas 
putida GPp\04 
(pDRl) 


LB 


100 


0 


0 


90 


0 


LB/Glucose, 1% 


100 


0 


0 


92 


0 


M9/Caproate, 
12 mM (C6) 


no growth 


88 


0 


100* 


0 


M9/Octanoate, 
12 mM (C8) 


no growth 


90 


0 


92 . 


0 



1 100%, PHA in all cells; 0%, no PHA in any cell; data averaged from >5 fields of each of 3 different 
cultures, error less than 5%. n N-terminus only present. * Cell shape distorted by large quantity of PHA. 



These results suggest that the B. megaterium gene cluster, phaP, -£>, -R, -B, and 
-C, is functional in both E. coli and P. putida in so far as accumulation of PHA polymer. 
It is not known if the negative results obtained with pDRl were due to PhaC alone being 
insufficient to complement the PhaC mutant of P. putida or to synthesize PHA in E. coli. 

Example 13: Localization of PhaP and PhaC proteins 

Proteins associated with purified PHA inclusion-bodies may not accurately reflect 
the localization of the these proteins within the growing cell. Visualization oiphar.gfp 
gene product fusion proteins in living cells throughout culture growth is a useful method 
for determining both the localization of the pha gene products and their comparative 
levels in growing cells. PhaP and PhaC, as fusion proteins (Figure 4), localized to PHA 
inclusion-bodies at all time points tested throughout growth of Bacillus megaterium 
11561. The negative control (pHPS9) showed no fluorescence at any time point. The 
localization control (pGMBC) showed non-localized green fluorescence at all time 
points. The profiles of PHA accumulation in these two control strains were similar to 
that of the wild-type, where the quantity of PHA decreased during the lag phase, 
increased during exponential phase, and continued to increase at a lower steady state rate 
in stationary phase growth (32). 
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At time 0, cultures of Bacillus megaterium carrying, pGM16.2, pGM13, pGM13C 
or pHPS9 5 grown in LB with LM 25 EM 1 for 24 hours at 35°C ? were inoculated (5% v/v) 
into 75 mL of fresh media of the same composition, in 300 mL Naphelco flasks, and 
growth was continued at 27°C, 250 rpm. Optical densities of cultures were monitored 
and samples were removed for microscopy at time points starting at time 0, for up to 24 
hours. One part of each sample was immediately observed for green fluorescence by 
embedding in 1% low melting point agarose for viewing in phase contrast and under 
fluorescence for GFP, magnification xlOOO. Another part of each sample was stained for 
PHA and viewed under light microscopy and by fluorescence for PHA inclusion bodies, 
magnification xlOOO. Images were recorded using identical parameters for all samples to 
allow comparison of fluorescence and light intensities (f-stop, 1/15; brightness, 0.6; 
sharpness, 1.0; contrast, 0.8; color, 0.3; see also methods and materials). Results are 
shown in Figure 5 (A-F). 

PhaP, monitored as a PhaP::GFP fusion protein in pGM16.2 (Figures 5A and 5B), 
decreased significantly during the first half (2 hours) of lag phase growth, increased 
during late lag phase and early to mid-exponential phase, decreased in mid to late 
exponential phase and increased during stationary phase growth. A possible explanation 
for the rapid decrease of PhaP in lag phase is that PhaP may be a storage protein that is 
degraded as a source of amino acids. The profile of PHA accumulation in these cells 
(carrying pGM16.2) followed a similar pattern to that of PhaP except that PHA decreased 
only in the lag phase and continued to accumulate throughout other phases of culture 
growth. This data is consistent with PHA inclusion-bodies being a source of carbon, 
reducing equivalents and amino acids when the organism is first provided with fresh 
medium. Possible explanations as to why the level of PhaP and not PHA decreased at 
mid to late exponential phase are that either PhaP was synthesized at a slower rate than 
that of PHA, or PhaP was used as a source of amino acids at this phase of growth or both 
scenarios may apply. 

PhaC, monitored as a PhaC::GFP fusion protein in pGM13 showed a similar 
profile of expression to that of PhaP with one exception: PhaC did not reduce in level 
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during lag phase growth. It did, however, reduce in level in mid to late exponential phase 
growth, as did PhaP. The profile of PHA accumulation in these cells carrying PhaC::GFP 
was similar to that of cells carrying PhaP::GFP, except that the PHA level did not reduce 
during lag phase growth. The increased quantity of PhaC in the cell is a likely 
explanation since PhaC remained functional in the fusion protein PhaC::GFP. This was 
indicated by the fact that Escherichia coli DH5a (pC/GFP3) and Escherichia coli DH5a 
(pGM7) accumulated PHA to equivalent low levels, while the host strain alone, or 
carrying pGFPuv accumulated no PHA, as visualized by fluorescence microscopy of Nile 
Blue A stained cells. The reduction in level of PhaC in mid to late exponential phase, as 
was also seen with PhaP, is consistent with both PhaC and PhaP being synthesized at a 
slower rate than that of PHA. 

In cells of all growth phases, inclusion-bodies were rarely visible under light in 
stained heat fixed cells while larger inclusion-bodies were visible in phase contrast of 
living cells (Figure 5C-F). In older cultures (2 days and older) some cells were lysed, and 
showed PhaP::GFP and PhaC::GFP localized to free PHA inclusion-bodies (Figure 5D). 
Both free and intracellular inclusion-bodies had doughnut shaped localization of GFP at 
some focal planes while at other focal planes the same inclusion-bodies appeared 
completely covered in GFP. We interpret this data as a difference in quantity of GFP that 
is visible when viewed through the edge or the center of the inclusion-bodies. 

Example 14: Analysis of Bacillus megaterium 3-ketoacvl-Co A reductase PhaB 

Stereospecificity assays were conducted on the Bacillus megaterium reductase 
using various chain length enoyl-CoA esters (C4-C8, Table 6). The assay was done using 
crotonase from Sigma (L-hydroxy acids) or hydratase from Rhodosprillum rubrum (D- 
hydroxy acids) to form the 3-hydroxyacyl-CoA compounds from the enoyl-CoA esters. 
Acetoacetyl-CoA reductase activity was monitored spectrophotometrically as the 
reduction of NADP + while 3-hydroxyacyl-CoAs were oxidized. Based on the assay 
results (Table 6) the Bacillus megaterium reductase is a D-specific enzyme with a 
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preference for C6 carbon chains. Enzyme reactions using NADH as electron donor for 3- 
ketoacyl-CoA reduction did not indicate significant enzyme activity with this cofactor. 

Table 6: Analysis for stereo-specificity of the Bacillus megaterium 3-ketoacyl-CoA 

reductase. 



Hone # a 

V^lUllC ft 


T~)-"?tereoi somer 
(hydratase) 


Spec. act. 
U/mg 




Clone # 


L-stereoisomer 
(crotonase) 


Spec. act. 
U/mg 


Bl-30 


Crotonyl CoA 


0.155 




Bl-30 


Crotonyl CoA 


0.014 


Bl-30 


C5 


0.15 




Bl-30 


C5 


0.009 


Bl-30 


C6 


0.39 




Bl-30 


C6 


0.017 


Bl-30 


C8 


0.014 




Bl-30 


C8 


0.039 
















B5-20 


Crotonyl CoA 


0.077 




B5-20 


Crotonyl CoA 


0.004 


B5-20 


C5 


0.074 




B5-20 


C5 


0.01 


B5-20 


C6 


0.219 




B5-20 


C6 


0.012 


B5-20 


C8 


0.003 




B5-20 


C8 


0.001 
















Negative 


Crotonyl CoA 


0.02 




Negative 


Crotonyl CoA 


0.001 


Negative 


C5 


0.011 




Negative 


C5 


0.003 


Negative 


C6 


0.006 




Negative 


C6 


0.008 


Negative 


C8 


0.033 




Negative 


C8 


0.003 



a Clone Bl-30 contains pMON48213; clone B5-20 contains pMON48214. 



Example 15: Verification of the Bacillus megaterium 3-ketoac vl-CoA reductase for PHA 
accumulation 



The functionality of the Bacillus megaterium sequence for PHA accumulation in a 
recombinant system was assayed. Escherichia coli DH5a harboring either pMON48222 
(phaA Re , phaB Bm , phaCJ only, or two of the following plasmids: pJM9238 AAB (phaA 
and phaB deleted by Fsel digest and religation) or pJM9117 AAB (phaA and phaB 
deleted by Fsel digest and religation) and pMON48220 (phaA Re , phaB Bm ,) was grown in 
LB + mannitol in concentrations of 1 or 2 % (w/v), respectively. Cultures were induced 
for PHA accumulation at OD 500 = 0.6. Percentage PHA (Table 7) and enzyme activity 
(Table 8) were determined. Plasmid pMON48213 contains the same pha sequences as 
pMON48220, but was constructed with pSE380 (Invitrogen, Carlsbad, CA), a high level 
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expression vector. Plasmid pMON48221 contains the same pha sequences as 
pMON48220, but lacks a small fragment of the multicloning site between phaA Ke and 
phaB Bm . 

3-Ketoacyl-CoA reductase was monitored in a total volume of 1 mL containing 
100 mM potassium phosphate buffer pH 7.0, 50 uM acetoacetyl-CoA and 150 uM 
NADPH. The reaction mixture contained between 5 and 50 uL cell extract. Assays were 
monitored spectrophotometrically at 340 nm. 

Table 7: Application of the Bacillus megatehum 3-ketoacyl-CoA reductase for PHA 

formation in Escherichia coli 



Vectors 






% PHA 


Standard deviation 












pMON48222-4 






12.9 




pMON48222-8 






19.2 


















Average 


16.1 


±4.5 






















pJM9238 AAB 


pMON48220 


23.7 




pJM9238 AAB 


pMON48220 


18.9 


















Average 


21.3 


±3.4 












pJM9238 AAB 




Average 


1.5 


± 1.5 






















pJM9117 AAB 


pMOh 


148220 


12.5 




pJM9117 AAB 


pMON48220 


3.9 


















Average 


8.2 


±6.1 












pJM9117 AAB 




Average 


0.7 


±0.1 
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Table 8: Enzyme activity of the Bacillus megaterium 3-ketoacyl-CoA reductase using 

pMON48220 and pMON48213 



Vector 


acetoacetyl-CoA reductase[U/mg] 


Negative control 


0.08 






pMON48220-2 


0.24 




0.15 


pMON48220-9 


0.22 




0.23 


Average 


0.21 ±0.04 






pMON48213 


4.0 



Table 9: Verification of the Bacillus megaterium 3-ketoacyl-CoA reductase functionality 



E. coli DH5a containing plasmids 


Relevant genotype 


PHB content % CDW 


pJM9238AAB, pMON34610 


phaC Re 


nd 


pJM9238AAB, pMON34575 


phaC Re ,phaA Re 


1.2 ± 0.4 


pJM9238AAB, pMON48221 


phaC Re , phaA Re , phaB Bm 


22.2 ± 4.7 



nd - not detectable 



Example 16: Additional sequences in genomic fragment 

The 7,916 base pair genomic fragment (SEQ ID NO:l) additionally contained 
three complete open reading frames and one incomplete open reading frame encoding 
proteins in addition to PhaP, PhaQ, PhaR, PhaB, and PhaC. As indicated in Tables 3 and 
4, sequence comparisons suggest that ykoY (SEQ ID NO:22) encodes toxic anion 
resistance protein YkoY (SEQ ID NO:23), ykoZ (SEQ ID NO:24) encodes RNA 
polymerase sigma factor protein YkoZ (SEQ ID NO:25), and ybrM (SEQ ID NO:26) 
encodes a portion of the Na + -transporting ATP synthase protein YkrM (SEQ ID NO:27). 
Sequence sspD (SEQ ID NO:28) matches the known Bacillus megaterium sequence (4, 
10) encoding SspD (SEQ ID NO:29). While the activity of the proteins is identified by 
their similarity to other known proteins, it is possible that the proteins may have 
additional functionality involved in polyhydroxyalkanoate biosynthesis. 
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These nucleic acid and amino acid sequences may be used in nucleic acid 
segments, recombinant vectors, transgenic host cells, and transgenic plants. 

Example 17: One and two subunitPHA synthase proteins 

PHA synthases have been identified to be either one or two subunit enzymes (51). 
Single subunit enzymes have only the PhaC protein, while two subunit enzymes have 
PhaC and PhaE protein subunits. Nucleic acid sequences encoding PhaE subunits have 
been found to be located adjacent to the nucleic acid sequences encoding PhaC. 



Table 10: One and two subunit PHA synthases 



Source organism (Reference) 


Subunits 


PhaC Amino acids 


T. violacea (P45366, D48376) 


2 


355 


C. vinosum (P45370, S29274) 


2 


355 


T. pfennigii (WO 96/08566) 


2 


357 


Synechocystis sp. PCC6803 (50, 
D90906, S77327) 


2 


378 








P. oleovorans (22, A3 8604) 




559 


P. aeruginosa (S29305) 




559 


R. ruber (S25725) 




562 


R. eutropha (A34371) 




589 


A. caviae (D88825) 




594 


P. denitrificans (JC6023) 




624 


R. etli(3, U30612) 




636 








B. megaterium (SEQ ID NO:l 1) 




362 



Based on the number of amino acids in the deduced sequence and homology to 
known PhaC proteins, the B. megaterium would be expected to be part of a two subunit 
synthase. However, the nucleic acid sequences adjacent to phaC in the 7,916 base pair 
genomic fragment show no significant similarity to a phaE sequence. Upstream of phaC 
is phaB, and downstream is ykrM, a suspected Na + transporting ATP synthase (Table 4). 
In combination with the observation that the B. megaterium sequences were able to 
complement P. putida GPpl04 to accumulate PHA, this suggests that the B. megaterium 
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phaC may encode a novel class of PHA synthase, i.e. a single subunit synthase with a 
molecular weight in the range of two subunit PhaC proteins. 

Example 18: Pathway for the production of C4/C6/C8/C 10 PHA copolymers 

Figure 10 outlines a proposed biosynthetic pathway for the production of PHA 
copolymers incorporating C4 and/or C6 monomer units. Produced polymers may include 
C4-CO-C6, C4-co-C8, C4-co-C6-co-C8, C6-CO-C8, C6, and C8. A recombinant host cell 
or plant may be constructed to contain the nucleic acid sequences encoding the required 
enzymes. 

The p-ketothiolase is preferably BktB (53, WO 98/00557). The p-ketothiolase 
can condense two molecules of acetyl-CoA to form acetoacetyl-CoA. This product may 
be reduced to 3HB-CoA by the Bacillus megaterium 3-keto-acyl-CoA reductase protein. 
3HB-CoA may be converted to crotonyl-CoA by a hydratase such as that from 
Aeromonas caviae (54). Subsequent reduction to butyryl-CoA is performed by a butyryl- 
CoA dehydrogenase such as that cloned from Clostridium acetobutylicum (55). This 
product may be condensed with acetyl-CoA by the p-ketothiolase to afford 3- 
ketohexanoyl-CoA. This is the preferred substrate of the Bacillus megaterium reductase, 
leading to the production of 3-hydroxyhexanoyl-CoA. This product may be incorporated 
into C6 polymers or copolymers (e.g. C4-CO-C6) by a PHA synthase having a broad 
substrate specificity (e.g. (56)). An additional round of condensation may lead to 
production of the C8 monomer, allowing the introduction of C8 into PHA polymers or 
copolymers. A further additional round of condensation may lead to production of the 
CIO monomer, allowing the introduction of CIO into PHA polymers or copolymers. 

Example 19: Nucleic acid mutation and hybridization 

Variations in the nucleic acid sequence encoding a protein may lead to mutant 
protein sequences that display equivalent or superior enzymatic characteristics when 
compared to the sequences disclosed herein. This invention accordingly encompasses 
nucleic acid sequences which are similar to the sequences disclosed herein, protein 
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sequences which are similar to the sequences disclosed herein, and the nucleic acid 
sequences that encode them. Mutations may include deletions, insertions, truncations, 
substitutions, fusions, shuffling of subunit sequences, and the like. 

Mutations to a nucleic acid sequence may be introduced in either a specific or 
random manner, both of which are well known to those of skill in the art of molecular 
biology. A myriad of site-directed mutagenesis techniques exist, typically using 
oligonucleotides to introduce mutations at specific locations in a nucleic acid sequence. 
Examples include single strand rescue (Kunkel, T. Proc. Natl. Acad. Sci. U.S.A., 82: 488- 
492, 1985), unique site elimination (Deng and Nickloff, Anal. Biochem. 200: 81, 1992), 
nick protection (Vandeyar, et al. Gene 65: 129-133, 1988), and PCR (Costa, et al. 
Methods Mol. Biol. 57: 31-44, 1996). Random or non-specific mutations may be 
generated by chemical agents (for a general review, see Singer and Kusmierek, Ann. Rev. 
Biochem. 52: 655-693, 1982) such as nitrosoguanidine (Cerda-Olmedo et al., J. Mol. Biol. 
33: 705-719, 1968; Guerola, et al. Nature New Biol. 230: 122-125, 1971) and 2- 
aminopurine (Rogan and Bessman, J. Bacterial. 103: 622-633, 1970), or by biological 
methods such as passage through mutator strains (Greener et al. Mol. Biotechnol. 7: 189- 
195, 1997). 

Nucleic acid hybridization is a technique well known to those of skill in the art of 
DNA manipulation. The hybridization properties of a given pair of nucleic acids is an 
indication of their similarity or identity. Mutated nucleic acid sequences may be selected 
for their similarity to the disclosed nucleic acid sequences on the basis of their 
hybridization to the disclosed sequences. Low stringency conditions may be used to 
select sequences with multiple mutations. One may wish to employ conditions such as 
about 0.15 M to about 0.9 M sodium chloride, at temperatures ranging from about 20°C 
to about 55°C. High stringency conditions may be used to select for nucleic acid 
sequences with higher degrees of identity to the disclosed sequences. Conditions 
employed may include about 0.02 M to about 0.15 M sodium chloride, about 0.5% to 
about 5% casein, about 0.02% SDS and/or about 0.1% N-laurylsarcosine, about 0.001 M 
to about 0.03 M sodium citrate, at temperatures between about 50°C and about 70°C. 
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More preferably, high stringency conditions are 0.02 M sodium chloride, 0.5% casein, 
0.02% SDS, 0.001 M sodium citrate, at a temperature of 50°C. 

Example 20: Determination of homologous and degenerate n ucleic acid sequences 

Modification and changes may be made in the sequence of the proteins of the 
present invention and the nucleic acid segments which encode them and still obtain a 
functional molecule that encodes a protein with desirable properties. The following is a 
discussion based upon changing the amino acid sequence of a protein to create an 
equivalent, or possibly an improved, second-generation molecule. The amino acid 
changes may be achieved by changing the codons of the nucleic acid sequence, according 
to the codons given in Table 11. 



Table 1 1 : Codon degeneracies of amino acids 



Amino acid 


One letter 


Three letter 


Codons 


Alanine 


A 


Ala 


GCA GCC GCG GCT 


Cysteine 


C 


Cys 


TGC TGT 


Aspartic acid 


D 


Asp 


GAC GAT 


Glutamic acid 


E 


Glu 


GAA GAG 


Phenylalanine 


F 


Phe 


TTC TTT 


Glycine 


G 


Gly 


GGA GGC GGG GGT 


Histidine 


H 


His 


CAC CAT 


Isoleucine 


I 


He 


ATA ATC ATT 


Lysine 


K 


Lys 


AAA AAG 


Leucine 


L 


Leu 


TTA TTG CTA CTC CTG CTT 


Methionine 


M 


Met 


ATG 


Asparagine 


N 


Asn 


AAC AAT 


Proline 


P 


Pro 


CCA CCC CCG CCT 


Glutamine 


Q 


Gin 


CAA CAG 


Arginine 


R 


Arg 


AGA AGG CGA CGC CGG CGT 


Serine 


S 


Ser 


AGC AGT TCA TCC TCG TCT 


Threonine 


T 


Thr 


ACA ACC ACG ACT 


Valine 


V 


Val 


GTA GTC GTGGTT 


Tryptophan 


W 


Tip 


TGG 


Tyrosine 


Y 


Tyr 


TAC TAT 
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Certain amino acids may be substituted for other amino acids in a protein 
sequence without appreciable loss of enzymatic activity. It is thus contemplated that 
various changes may be made in the peptide sequences of the disclosed protein 
sequences, or their corresponding nucleic acid sequences without appreciable loss of the 
biological activity. 

In making such changes, the hydropathic index of amino acids may be considered. 
The importance of the hydropathic amino acid index in conferring interactive biological 
function on a protein is generally understood in the art (Kyte and Doolittle, J. Mol. Biol, 
157: 105-132, 1982). It is accepted that the relative hydropathic character of the amino 
acid contributes to the secondary structure of the resultant protein, which in turn defines 
the interaction of the protein with other molecules, for example, enzymes, substrates, 
receptors, DNA, antibodies, antigens, and the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics. These are: isoleucine (+4.5); valine (+4.2); 
leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine 
(+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); 
proline (-1.6); histidine (-3.2); glutamate/glutamine/aspartate/asparagine (-3.5); lysine (- 
3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other amino 
acids having a similar hydropathic index or score and still result in a protein with similar 
biological activity, i.e., still obtain a biologically functional protein. In making such 
changes, the substitution of amino acids whose hydropathic indices are within ±2 is 
preferred, those within ±1 are more preferred, and those within ±0.5 are most preferred. 

It is also understood in the art that the substitution of like amino acids may be 
made effectively on the basis of hydrophilicity. U.S. Patent No. 4,554,101 (Hopp, T.P., 
issued November 19, 1985) states that the greatest local average hydrophilicity of a 
protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a 
biological property of the protein. The following hydrophilicity values have been 
assigned to amino acids: arginine/lysine (+3.0); aspartate/glutamate (+3.0 ±1); serine 
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(+0.3); asparagine/glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ±1); 
alanine/histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine/isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); and tryptophan (-3.4). 

It is understood that an amino acid may be substituted by another amino acid 
having a similar hydrophilicity score and still result in a protein with similar biological 
activity, i.e., still obtain a biologically functional protein. In making such changes, the 
substitution of amino acids whose hydropathic indices are within +2 is preferred, those 
within ±1 are more preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substitutions are therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of 
the foregoing characteristics into consideration are well known to those of skill in the art 
and include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine 
and asparagine; and valine, leucine, and isoleucine. Changes which are not expected to 
be advantageous may also be used if these resulted in functional fusion proteins. 

Plant Vectors 

In plants, transformation vectors capable of introducing nucleic acid sequences 
encoding polyhydroxyalkanoate biosynthesis enzymes are easily designed, and generally 
contain one or more nucleic acid coding sequences of interest under the transcriptional 
control of 5' and 3' regulatory sequences. Such vectors generally comprise, operatively 
linked in sequence in the 5' to 3' direction, a promoter sequence that directs the 
transcription of a downstream heterologous structural nucleic acid sequence in a plant; 
optionally, a 5' non-translated leader sequence; a nucleic acid sequence that encodes a 
protein of interest; and a 3' non-translated region that encodes a polyadenylation signal 
which functions in plant cells to cause the termination of transcription and the addition of 
polyadenylate nucleotides to the 3' end of the mRNA encoding the protein. Plant 
transformation vectors also generally contain a selectable marker. Typical 5 '-3' 
regulatory sequences include a transcription initiation start site, a ribosome binding site, 
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an RNA processing signal, a transcription termination site, and/or a polyadenylation 
signal. Vectors for plant transformation have been reviewed in Rodriguez et al. (Vectors: 
A Survey of Molecular Cloning Vectors and Their Uses, Butterworths, Boston., 1988), 
Glick et al. (Methods in Plant Molecular Biology and Biotechnology, CRC Press, Boca 
Raton, Fla., 1993), and Croy (Plant Molecular Biology Labfax, Hames and Rickwood 
(Eds.), BIOS Scientific Publishers Limited, Oxford, UK., 1993). 

Plant Promoters 

Plant promoter sequences can be constitutive or inducible, environmentally- or 
developmentally-regulated, or cell- or tissue-specific. Often-used constitutive promoters 
include the CaMV 35S promoter (Odell, J.T. et al., Nature 313: 810-812, 1985), the 
enhanced CaMV 35S promoter, the Figwort Mosaic Virus (FMV) promoter (Richins et 
al., Nucleic Acids Res. 20: 8451-8466, 1987), the mannopine synthase (mas) promoter, 
the nopaline synthase (nos) promoter, and the octopine synthase (ocs) promoter. Useful 
inducible promoters include promoters induced by salicylic acid or polyacrylic acids (PR- 
1, Williams , S. W. et al, Biotechnology 10: 540-543, 1992), induced by application of 
safeners (substituted benzenesulfonamide herbicides, Hershey, H.P. and Stoner, T.D., 
Plant Mol. Biol. 17: 679-690, 1991), heat-shock promoters (Ou-Lee et al., Proc. Natl. 
Acad. Sci U.S.A. 83: 6815-6819, 1986; Ainley et al., Plant Mol. Biol. 14: 949-967, 1990), 
a nitrate-inducible promoter derived from the spinach nitrite reductase gene (Back et al., 
Plant Mol. Biol. 17: 9-18, 1991), hormone-inducible promoters (Yamaguchi-Shinozaki, 
K. et al., Plant Mol. Biol. 15: 905-912, 1990; Kares et al., Plant Mol. Biol. 15: 225-236, 
1990), and light-inducible promoters associated with the small subunit of RuBP 
carboxylase and LHCP gene families (Kuhlemeier et al., Plant Cell 1: 471, 1989; 
Feinbaum, R.L. et al., Mol. Gen. Genet. 226: 449-456, 1991; Weisshaar, B. et al., EMBO 
J. 10: 1777-1786, 1991; Lam, E. and Chua, N.H., J. Biol. Chem. 266: 17131-17135, 
1990; Castresana, C. et al., EMBO J. 7: 1929-1936, 1988; Schulze-Lefert et al., EMBO J. 
8: 651, 1989). Examples of useful tissue-specific, developmentally-regulated promoters 
include the p-conglycinin 7S promoter (Doyle, J.J. et al., J. Biol. Chem. 261: 9228-9238, 
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1986; Slighton and Beachy, Planta 172: 356-363, 1987), and seed-specific promoters 
(Knutzon, D.S. et al., Proc. Natl. Acad. Sci U.S.A. 89: 2624-2628, 1992; Bustos, M.M. et 
al, EMBOJ. 10: 1469-1479, 1991; Lam and Chua, Science 248: 471, 1991; Stayton et 
al., Aust J. Plant. Physiol 18: 507, 1991). Plant functional promoters useful for 
preferential expression in seed plastids include those from plant storage protein genes and 
from genes involved in fatty acid biosynthesis in oilseeds. Examples of such promoters 
include the 5' regulatory regions from such genes as napin (Kridl et al., Seed Sci. Res. 1: 
209-219, 1991), phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP 
desaturase, and oleosin. Seed-specific gene regulation is discussed in EP 0 255 378. 
Promoter hybrids can also be constructed to enhance transcriptional activity (Comai, L. 
and Moran, P.M., U.S. Patent No. 5,106,739, issued April 21, 1992), or to combine 
desired transcriptional activity and tissue specificity. A developing seed selective, 
promoter may be obtained from the fatty acid hydroxylase gene of Lesquerella (P-lh) 
(Broun, P. and C. Somerville. Plant Physiol. 1 13: 933-942, 1997). 

Plant transformation and regeneration 

A variety of different methods can be employed to introduce such vectors into 
plant protoplasts, cells, callus tissue, leaf discs, meristems, etcetera, to generate 
transgenic plants, including Agrobacterium-mQdixdXQd transformation, particle gun 
delivery, microinjection, electroporation, polyethylene glycolmediated protoplast 
transformation, liposome-mediated transformation, etcetera (reviewed in Potrykus, I. 
Ann. Rev. Plant Physiol Plant Mol Biol 42: 205-225, 1991). In general, transgenic 
plants comprising cells containing and expressing DNAs encoding polyhydroxyalkanoate 
biosynthesis proteins can be produced by transforming plant cells with a DNA construct 
as described above via any of the foregoing methods; selecting plant cells that have been 
transformed on a selective medium; regenerating plant cells that have been transformed to 
produce differentiated plants; and selecting a transformed plant which expresses the 
protein-encoding nucleotide sequence. 
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Specific methods for transforming a wide variety of dicots and obtaining 
transgenic plants are well documented in the literature (Gasser and Fraley, Science 244: 
1293-1299, 1989; Fisk and Dandekar, Scientia Horticulturae 55: 5-36, 1993; Christou, 
Agro Food Industry Hi Tech, p. 17, 1994; and the references cited therein). 

Successful transformation and plant regeneration have been reported in the 
monocots as follows: asparagus (Asparagus officinalis; Bytebier et al., Proc. Natl. Acad. 
Sci. U.S.A. 84: 5345-5349, 1987); barley (Hordeum vulgarae; Wan and Lemaux, Plant 
Physiol. 104: 37-48, 1994); maize (Zea mays; Rhodes, C.A. et al., Science 240: 204-207, 
1988; Gordon-Kamm et al., Plant Cell 2: 603-618, 1990; Fromm, M.E. et al., 
Bio/Technology 8: 833-839, 1990; Koziel et al., Bio/Technology 11: 194-200, 1993); oats 
(Avena sativa; Somers et al., Bio/Technology 10: 1589-1594, 1992); orchardgrass 
(Dactylis glomerata; Horn et al., Plant Cell Rep. 7: 469-472, 1988); rice (Oryza sativa, 
including indica and japonica varieties; Toriyama et al., Bio/Technology 6: 10, 1988; 
Zhang et al., Plant Cell Rep. 7: 379-384, 1988; Luo and Wu, Plant Mol. Biol. Rep. 6: 
165-174, 1988; Zhang and Wu, Theor. Appl. Genet. 76: 835-840, 1988; Christou et al., 
Bio/Technology 9: 957-962, 1991); rye (Secale cereale; De la Pena et al., Nature 325: 
274-276, 1987); sorghum (Sorghum bicolor; Casas, A.M. et al., Proc. Natl. Acad. Sci. 
U.S.A. 90: 11212-11216, 1993); sugar cane (Saccharum spp.; Bower and Birch, Plant J. 
2: 409-416, 1992); tall fescue (Festuca arundinacea; Wang, Z.Y. et al., Bio/Technology 
10: 691-696, 1992); turfgrass (Agrostis palustris; Zhong et al., Plant Cell Rep. 13: 1-6, 
1993); wheat (Triticum aestivum; Vasil et al, Bio/Technology 10: 667-674, 1992; Weeks, 
T. et al., Plant Physiol. 102: 1077-1084, 1993; Becker et al., Plant J. 5: 299-307, 1994), 
and alfalfa (Masoud, S.A. et al., Transgen. Res. 5: 313, 1996). 

Host plants 

Particularly useful plants for polyhydroxyalkanoate production include those that 
produce carbon substrates, including tobacco, wheat, potato, Arabidopsis, and high oil 
seed plants such as corn, soybean, canola, oil seed rape, sugarbeet, sunflower, flax, 
peanut, sugarcane, switchgrass, and alfalfa. 
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Example 21 : Plastid transformation 

Alternatively, polyhydroxyalkanoate biosynthesis enzymes facilitating the 
increase in oil content of plants and/or herbicide resistance discussed herein can be 
expressed in situ in plastids by direct transformation of these organelles with appropriate 
recombinant expression constructs. Constructs and methods for stably transforming 
plastids of higher plants are well known in the art (Svab, Z. et al., Plant Mol Biol 14(2): 
197-205, 1990; Svab et al., Proc. Natl Acad. Sci. USA. 90(3): 913-917, 1993; Staub et 
al., EMBO 7. 12(2): 601-606, 1993; Maliga et al., U.S. Patent No. 5,451,513; PCT 
International Publications WO 95/16783, WO 95/24492, and WO 95/24493). These 
methods generally rely on particle gun delivery of DNA containing a selectable or 
scorable marker in addition to introduced DNA sequences for expression, and targeting of 
the DNA to the plastid genome through homologous recombination. Transformation of a 
wide variety of different monocots and dicots by particle gun bombardment is routine in 
the art (Hinchee et al., 1994; Walden and Wingender, 1995). The plastid may be 
transformed by using protoplast and PEG (polyethylene glycol) (Koop, et al., Physiol 
Plant. 85: 339, 1992; Golds et al., Bio/Technol. 11: 95-97, 1993), cocultivation of 
protoplasts and Agrobacteria carrying transformation vectors (De Block et al., EMBO J. 
4: 1367-1372, 1985), and by electroporation (Kin-Ying et al., Plant J. 4: 737, 1996). 

Nucleic acid constructs for plastid transformation generally comprise a targeting 
segement comprising flanking nucleic acid sequences substantially homologous to a 
predetermined sequence of a plastid genome, which targeting segment enables insertion 
of nucleic acid coding sequences of interest into the plastid genome by homologous 
recombination with the predetermined sequence; a selectable marker sequence, such as a 
sequence encoding a form of plastid 16S ribosomal RNA that is resistant to 
spectinomycin or streptomycin, or that encodes a protein which inactivates spectinomycin 
or streptomycin (such as the aadA gene), disposed within the targeting segment, wherein 
the selectable marker sequence confers a selectable phenotype upon plant cells, 
substantially all the plastids of which have been transformed with the nucleic acid 
construct; and one or more nucleic acid coding sequences of interest disposed within the 
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targeting segment relative to the selectable marker sequence so as not to interfere with 
conferring of the selectable phenotype. In addition, plastid expression constructs also 
generally include a plastid promoter region and a transcription termination region capable 
of terminating transcription in a plant plastid, wherein the regions are operatively linked 
to the nucleic acid coding sequences of interest. 

A further refinement in chloroplast transformation/expression technology that 
facilitates control over the timing and tissue pattern of expression of introduced nucleic 
acid coding sequences in plant plastid genomes has been described in PCT International 
Publication WO 95/16783. This method involves the introduction into plant cells of 
constructs for nuclear transformation that provide for the expression of a viral single 
subunit RNA polymerase and targeting of this polymerase into the plastids via fusion to a 
plastid transit peptide. Transformation of plastids with nucleic acid constructs 
comprising a viral single subunit RNA polymerase-specific promoter specific to the RNA 
polymerase expressed from the nuclear expression constructs operably linked to nucleic 
acid coding sequences of interest permits control of the plastid expression constructs in a 
tissue and/or developmental specific manner in plants comprising both the nuclear 
polymerase construct and the plastid expression constructs. Expression of the nuclear 
RNA polymerase coding sequence can be placed under the control of either a constitutive 
promoter, or a tissue- or developmental stage-specific promoter, thereby extending this 
control to the plastid expression construct responsive to the plastid-targeted, nuclear- 
encoded viral RNA polymerase. The introduced nucleic acid coding sequence can be a 
single encoding region, or may contain a number of consecutive encoding sequences to 
be expressed as an engineered or synthetic operon. The latter is especially attractive 
where, as in the present invention, it is desired to introduce multigene biochemical 
pathways into plastids. This approach is more complex using standard nuclear 
transformation techniques since each gene introduced therein must be engineered as a 
monocistron, including an encoded transit peptide and appropriate promoter and 
terminator signals. Individual gene expression levels may vary widely among different 
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cistrons, thereby possibly adversely affecting the overall biosynthetic process. This can 
be avoided by the chloroplast transformation approach. 

All of the compositions and/or methods disclosed and claimed herein can be made 
and executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
to the compositions and/or methods and in the steps or in the sequence of steps of the 
methods described herein without departing from the concept, spirit and scope of the 
invention. More specifically, it will be apparent that certain agents which are both 
chemically and physiologically related may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substitutes and 
modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention. 
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